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The  first  limited  dependent  variable  model  was  proposed 
by  Tobin.   His  approach  is  now  known  as  Tobit  analysis. 
Tobit  analysis  uses  a  maximiun  likelihood  procedure  to  esti- 
mate models  with  a  truncated  --  that  is,  a  limited  --  depend- 
ent variable  when  a  large  number  of  observations  take  on  the 
truncating  threshold.   Tobin' s  model  has  recently  been  ex- 
tended to  handle  more  complicated  situations.   One  example 
of-  these  extensions  is  a  switching  regression  model  where 
the  switching  function  is  endogeneous  to  the  model.   In  this 
thesis  the  switching  regression  model  is  used  to  study  hous- 
ing demand  equations. 

The  thesis  argues  that  previous  studies  on  housing  only 
emphasize  one  part  of  a  complete  model.-   Some  of  these  past 
studies  only  consider  demand  equations  for  owners  and  renters, 
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while  others  only  consider  th?  consumer's  rent  versus  own 
decision.   The  thesis  proposes  and  estimates  a  complete 
model  which  allows  for  the  simultaneous  determination  of 
whether  or  not  to  own  and  how  much  to  spend.   In  this 
model  estimation  of  the  housing  demand  equations  by  OLS 
will  be  biased  if  there  is  simultaneity  between  these 
equations  and  the  switching  function.   To  avoid  this  bias 
the  parameters  are  estimated  by  a  two  step  technique  and 
a  two  step  maximum  likelihood  procedure.   Both  these  pro- 
cedures are  asymptotically  consistent  and  the  two  step 
procedure  also  is  asymptotically  efficient.   In  order  to 
test  for  simultaneity  the  demand  equations  also  are  esti- 
mated by  ordinary  least  squares  (OLS) .   A  maximum  likeli- 
hood ratio  test  then  is  used  to  compare  the  two  step  and 
OLS  estimates.   The  results  of  this  test  show  that  simul- 
taneity is  present.   Therefore,  the  thesis  concludes  that 
OLS  estimates  of  the  demand  equations  are  biased  and  the 
two  stage  and  two  step  procedures  are  more  appropriate. 
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CHAPTER  I        •  ■ 
INTRODUCTION 
Many  models  of  housing  demand  have  been  presented  in 
the  literature.   The  earlier  models  were  estimated  with 
aggregate  time  series  or  cross-sectional  data.   For  example, 
see  Muth  (60) ,  Reid  (62)  or  Winger  (68) .   More  recent  studies 
such  as  Carliner  (73)  and  Fenton  (74)  used  micro  data  to 
estimate  demand  for  housing  equations. -  In  general  all  these 
past  studies  use  a  least  squares  technique  and  estimate  two 
housing  demand  equations  —  one  for  owners  and  another  for 
renters.   One  purpose  of  this  thesis  is  to  show  that  under 
certain  conditions  these  least  squares  estimates  will  be 
biased.   This  bias  becomes  apparent  once  the  owner  and  renter 
equations  are  viewed  as  limited  dependent  variables"*"  that 
are  determined  simultaneously  with  a  rent  versus  own  choice 
function.   It  is  this  simultaneity  which  causes  the  OLS 
estimates  to  be  biased.   Therefore,  other  unbiased  estima- 
tion procedures  have  to  be  used.   One  such  method  is  a  "two 
stage"  technique  proposed  by  Lee  (76) .   Another  is  the  two 
step  maximum  likelihood  procedure  described  in  Amemiya  (74) . 
Both  procedures  are  used  in  this  thesis  and  the  results  are 
compared  to  the  usual  OLS  approach. 


A  limited  dependent  variable  is  a  dependent  variable  that 
has  a  lower  or  upper  limit,  and  takes  on  the  limiting  value 
for  a  substantial  number  of  respondents.   Models  of  this 
type  have  attracted  great  attention  recently.   For  examples, 
see  Cragg  (71),  Maddala  and  Nelson  (74)  and  Heckman  (74). 


A  second  purpose  of  th.i  s  thesis  is  to  derive  a  model  of 
housing  demand  with  interdependent  choices  about  owning  or 
renting.   To  .accomplish  this  task  housing  is  viev/ed  as  a 
bundle  of  several  goods  such  as  location  in  a  given  area, 
type  of  structure,  size" and  quality  of  neighbors.   In  general 
any  given  bundle  will  have  one  price  if  it  is  owned  and 
another  price  if  rented.   Following  this  line  of  thought, 
some  families  will  be  better  off  renting  while  others  will 
prefer  to  buy  their  own  home.   The  model  derived  in  this 
thesis  assumes  that  this  rent  versus  own  choice  function  and 
the  expenditure  equations  depend  on  certain  independent 
variables  as  well  as  stochastic  disturbances  that  are  not 
necessarily  independent.   These  variables  include  a  measure 
of  permanent  income;  personal  characteristics  of  the  family 
head  such  as  age,  race,  and  sex;  family  background  variables 
such  as  mobility  and  family  size;  various  price  indices;  and 
regional  variables  to  capture  the  effect  of  varying  land 
values. in  different  parts  of  the  U.S. 

In  Chapter  II  past  housing  studies  are  critiqued. 
Chapter  III  presents  the  formal  model.   In  Chapter  IV  con- 
sistent estimates  of  the  rent  and  ovm  expenditure  equations 
are  obtained  by  the  two  stage  method.   The  choice  function 
is  estimated  by  Probit  analysis.   Chapter  V  estimates  the 
model  with  a  two  step  maximum  likelihood  procedure  and  tests 
for  simultaneity  with  a  maximum  likelihood  ratio  test. 
Chapter  VI  contains  the  conclusions.   Finally,  the  appendices 
contain  a  discussion  of  the  data  and  an  extension  of  the 
model  to  the  multi-period  case. 


CHAPTER  II 
A  CRITIQUE  OF  PREVIOUS  STUDIES 
In  the  past  fifteen  years  numerous  studies  of  the  house- 
liold's  demand  and  choice  of  housinq  have  been  presented  in 
the  literature.   The  earlier  papers  studied  macro  data  such 
as  aggregate  U.S.  time  series  data  or  the  average  income  and 
housing  expenses  of  given  SMSA's  and  cities.   The  main  con- 
cern of  these  studies  was  to  obtain  estimates  of  the  income 
elasticity  demand  for  housing.   In  general  these  elasticity 
estimates  were  greater  tlian  one.   For  example,  see  Muth  (60), 
Reid  (62)  or  Winger  (68).   Muth's  (60)  study  uses  time  series 
data  and,  in  a  separate  analysis,  he  also  uses  cross-sectional 
data.   He  concludes  that  the  income  elasticity  for  desired 
liousing  stock  is  at  least  1.0.   In  a  reviev;  article,  de  Leeuw 
(71)  makes  adjustments   in  the  Muth  (60),  Reid  (62), and  VUnger 
(68)  estimates.   He  concludes  that  the  income  elasticity  of 
rental  housing  in  the  United  States  is  between  .8  and  1.0. 
A  second  conclusion  is  that  the  income  elasticity  for  owner- 
occupied  housing  is  probably  higher  than  that  for  rental 
housing.   However,  even  de  Leeuw 's  results  are  quite  high 
when  compared  to  the  estimates  obtained  with  micro  data. 
For  example,  using  micro  data  Carliner's  (73)  estimates  were 


^['hese  "adjustments"  stem  from  the  fact  that  past  studies  did 
not  use  the  same  data  base  or  the  same  definition  of  variables 
when  estimating  the  elasticities.   For  example,  some  did  not 
include  imputed  rent  in  the  definition  of  income  for  homeowners, 


.631  for  ov/ners  and  .520  for  renters.   Also  using  micro' data, 
Fenton  (74)  estimated  income  elasticities  of  .41,  .46  and  .46 
for  all  renter  households,  and  high  and  low  income  renter 
households,  respectively.   Polinsky  (75)  concludes  that  these 

past  macro  results  are  biased  upward  and  the  micro  results 

2 
biased  downward.    He  then  states  that  the  true  income  elas- 
ticity is  around  .75  and  the  true  price  elasticity  is  approx- 
imately -.75.   \^lhile  Polinsky' s  criticisms  deal  with  the 
specification  of  the  price  variable,  the  critique  in  this 
chapter  is  aimed  at  the  technique  used' to  estimate  these  past 
models.   Since  a  typical  sample  is  comprised  of  families  who 
either  own  their  home  or  rent,  these  models  can  be  divided 
into  one  of  three  groups. 

A  naive  approach  is  to  divide  the  sample  data  into 
owners  and  renters  and  estimate  each  subgroup  separately. 
This  method  results  in  two  unique  expenditure  equations: 
one  for  homeowners,  estimated  without  using  the  information 
in  the  renter  subsample;  another  for  renters,  estimated 
without  using  the  information  contained  in  the  owner  sub- 
sample.   For  examples  see  Lee  (68) ,  Ohls  (7l) ,  Carliner  (73) 
or  Fenton  (74). 


2 

Polinsky  argues  that  the  omission  or  misspecif ication  of  the 

price  term  biases  the  ungroups  (i.e.  micro)  estimates  downward 
and  the  groups  (i.e.  macro)  estimates  upward. 

3 
Pxatlier  than  divide  the  sample  into  owners  and  renters  one  can 

assume  that  all  or  part  of  the  parameters  in  the  two  expendi- 
ture equations  are  the  same  for  owners  and  renters.   A  single 
equation  can  then  be  estimated  from  the  entire  sample.   Dummy 
variables  are  used  on  the  subset  of  parameters  assumed  differ- 
ent for. owners  and  renters. 


The  attractiveiiess  of  t'le  above  approach  is  that  an 
ordinary  least  squares  estimation  technique  (OLS)  can  be 
used.   A  second  alternative,  the  Tobit  model,  requires 
maximum  likelihood  estimation.   This  method,  first  des- 
cribed by  Tobin  (58),  results  in  two    unique  demand  equa- 
tions.  It  differs  from  the  first  approach  in  that  each 
equation  is  estimated  using  all  the  sample  information. 
This  is  done  by  assigning  a  zero  value  to  the  renters 
(owners)  when  estimating  tlie  demand  equation  for  owners 
(renters) .   In  his  paper  Tobin  examines  buyers  and  non- 
buyers  of  a  durable  good  in  any  given  year.   He  shows 
that  OLS  on  the  subset  of  buyers  is  both  biased  and  in- 
efficient.  Ke  then  suggests  a  maximum  likelihood  techni- 
que (Tobit)  that  is  both  unbiased  and  efficient.- 

In  these  first  two  approaches  either  housing  demand 
or  expenditure  equations  are  estimated.   The  dependent 
variable  is  either  the  homogeneous  comniodity  called  "housing 
services"  discussed  in  Olsen  (69)  or  various  Lancastrian 
"housing  characteristics"  described  in  King  (76). 

A  tliird  approach  is  to  only  model  the  choice  of  whether 
to  buy  or  rent.   This  can  be  accomplished  with  a  linear  pro- 
bability model,  a  Probit  model  or  a  Logit  model.   With  this 
method  one  does  not  obtain  demand  equations  but  rather  an 
equation  to  predict  the  probability  that  a  given  fam.ily 
will  own  their  own  home.   Examples  of  the  Probit  model  can 
be  found  in  Ohls  (71)  and  Poirier  (74)  .   Quigley  (76)  , 
in  a  similar  analysis,  uses  the  Logit  model  to  estimate  the 
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probability  that  a  given  fa.nUy  will  choose  among  18  types 
of  residential  housing.   The  coefficients  obtained  by  the'se 
models  are  estimates  of  the  true  coefficients  in  the  choice 
equation  divided  by  a  common  scale  factor.   The  coeffici- 
ents can  only  be  estimated  up  to  a  scale  factor  because  the 
dependent  variable  is  never  observed  and  therefore  the 
variance  of  the  disturbance  term  cannot  be  estimated.   For 
example,  if  the  dependent  variable  is  the  utility  obtain- 
able when  a  family  ov/ns  less  the  utility  obtainable  when 
that  same  family  rents,  this  difference  of  utility  is  never 
observed.   Ratlier,  we  only  observe  a  one  (ov;ning)  when  this 
difference  is  positive  and  a  zero  (renting)  when  this  differ- 
ence is  negative.   The  Probit  and  Logit  models  use  a  maximum 
likelihood  technique  to  estimate  the  coefficients.   These 
coefficients  are  then  used  to  assign  each  family  with  a 
number  betv;een  zero  and  one.   This  number  can  be  interpreted 
as  the  probability  that  a  given  family  will  own  their  own  home. 
For  Probit  Analysis  this  probability  is  derived  by  first 
multiplying  the  independent  variables  by  their  respective 
coefficients.   The  scalar  obtained  from  this  multiplication 
is  a  standard  normal  variate.   The  probability  of  owning  is 
simply  the  Standard  Normal  Cumulative  function  evaluated 
atthisscalar. 

While  numerous  examples  of  the  above  alternatives  can 
be  cited  in  the  literature,  no  one  has  studied  models  where 
the  decision  on  how  much  to  spend  is  determined  simultaneously 
with  the  rent-own  decision.   An  approach  designed  to  fill  this 
gap  in  the  literature  is  derived  in  the  following  chapter. 


CHAPTER  III 

A  MODEL  OF  INTERDEPENDENT  CHOICES 

1.   An  Interdopondent  Choice  Model  Based  on  Utility 
Maximization 

The  major  focal  point  of  this  section  is  to  sort  out 
the  factors  that  determine  why  some  families  ov;n  their  own 
home  and  others  rent.   While  many  speci-fic  reasons--such 
as  investment  criteria  or  simple  Tobit  analysis  --for  buying 
a  home  can  be  given,  the  choice  mechanism  presented  here  is 
based  on  the  simple  economic  principles  of  utility  analysis. 
The  hope  is  to  have  a  general  and  widely  applicable  model. 

The  model  will  view  the  decision  to  procure  shelter  in 
an  imperfect  world.   First,  the  choice  of  a  family  to  rent 
or  buy  in  a  particular  neighborhood  is  limited  to  an  "either- 
or"  situation.   In  words,  a  family  must  choose  between  a 
number  of  discrete  alternatives  available.   That  is,  given 
price  and  nonprice  constraints  some  alternatives  may  not  be 
feasible;  for  example,  renting  an  apartment  with  a  large 
garden.   Second,  individual  families  face  a  number  of  con- 
straints such  as  budget  and  time  constraints.   Financial 


In  Tobin's  (58)  pioneering  paper  on  limited  dependent  vari- 
ables he  presents  an  estimation  technique  for  models  v^7hen 
the  dependent  variable  often  takes  on  a  lower  or  upper  limit. 
His  method  is  now  known  as  Tobit  analysis  and  has  many  appli- 
cations in  economics.   For  example,  a  family's  expenditure  on 
a  major  durable  such  as  a  house  may  be  zero  until  the  house- 
hold income  exceeds  a  certain  level. 


institutions  constrain  families  to  loans  of  limited  size 
by  requiring,  for  example,  that  housing  expenditures  not 
exceed  20-25  percent  of  income.   Third,  families  purchase 
both  durables  and  nondurables,  consuming  durables  over 
time.   Fourth,  housing  is  not  a  homogeneous  commodity. 
Rather,  v/hat  we  call  housing  is  really  a  bundle  of  several 
goods:   location,  house  size,  yard  size,  type  of  structure, 
central  air-conditioning  and ' even  type  of  neighbors.   To 
get  around  this  nonhomogeneity  problem  Olsen  (69)  defines 
a  homogeneous  commodity  which  he  calls '"housing  services." 
Olsen  argues  that  v/henever  the  housing  market  is  out  of 
equilibrium  the  existing  housing  stock  would  filter  (i.e. 
and  increase  or  decrease  in  the  quantity  of  housing 
services)  up  or  dovm  and  neis?  dwelling  units  would  be  con- 
structed.  Disequilibrium  is  said  to  exist  whenever  price 
is  greater  than  long  run  average  cost.   This  filtering  and 
construction  continues  until  there  are  no  profits  to  be 
made  on  bundles  of  housing  of  any  size.   This  zero  profit 
equilibrium  position  requires  the  price  per  unit  of  housing 
services  for  bundles  of  all  sizes  to  be  the  same.  -  Follow- 
ing this  line  of  thought,  if  two  families  in  an  equilibrium 
market  spend  different  amounts  on  housing,  the  implication 
is  that  both  fantilies  are  consuming  different  quantities 
of  the  same  good--"housing  services."   This  quantity  com- 
parison is  possible  because  of  two  crucial  assumptions. 
First,  it  assumes  that  the  equilibrium  price  per  unit  of 
"housing  services"  has  been  reached.   Second,  it  assumes 
that  families  do  not  have  preferences  for  particular  types 


of  "housing  services"  such  a?  living  in  downtown  New  York 
City  versus  living  in  New  York  City  suburbs. 

A  Lancastrian  approach  to  the  nonhomogeneity  problem 
is  taken  by  King  (76) .   He  postulates  that  housing  is 
really  made  up  of  several  hon'ogeneous  characteristics. 
An  individual  has  separate  demands  for  each  of  these 
characteristics  but  not  a  demand  for' housing  per  se. 

■  In  order  to  distinguish  between  the  demand  for  renting-- 
which  was  primarily  the  demand  for  apartments  in  1971--and 
the  demand  for  houses,  a  slightly  different  ^approach  to  the 
nonhomogeneity  issue  is  taken  here.   For  simplicity  this 
approach  is  presented  as  a  one  period  maximization  problern. 
However,  the  model  is  general  enough  to  be  easily  extended 
to  the  multi-period  case.   For  example,  one  could  assume 
the  family  maximizes  expected  utility  summed  over  the 
entire  occupancy  of  a  dwelling.   A  multi-period  model  taking 
this  apiproach  is  presented  in  the  Appendix. 

Assume  for  the  moment  that  housing  merely  amounts  to 
the  size  of  the  dwelling.   The  larger  the  size,  the  greater 
is  the  quantity  of  housing  consumed.   In  this  simple  case 
an  ovmed  home  and  a  rented  apartment  are  perfect  substitutes. 
Indeed,  they  are  the  same  commodity  and  a  family's  decision 
to  own  or  rent  will  bo  based  on  price  alone.   From  consump- 
tion theory  we  know  that  in  equilibrium  the  family  will 
equate  the  ratj.os  of  marginal  utility  to  the  ratios  of 
marginal  cost  for  all  goods.   That  is,  if  the  family  owns 

M.HH)  _  ^_1V^ 
MU(X)       P^ 
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will  hold,  where  MU  is  margiiAal  utility,  MC   is  a  marginal 
cost  function  for  owning,  H  is  the  size  of  the  owned  home, 
X  represents  all  other  goods  and  P   is  the  price  of  all 
other  goods.   Similarly,  if-- the  family  rents  H  units  of 
housing 

MU(II)  _"^^C^(") 


MU(X)    ,   P 
will  hold,  v;here  MC   is  a  marginal  cost  function  for  renting. 
If  MC   and  MC   are  different  functions',  then  given  prices, 

or  '  -3  f  r 

preferences  and  income,  the  family  will  rent  or  ov;n  depend- 
ing  upon  which  is  cheaper."   For  example,  consider  the  partial 
equilibrium  analysis  depicted  in  Figure  1.   In  this  case  we 
would  observe  family  1  renting  H,  and  family  2  owning  H- .   In 
general  the  shape  of  these  marginal  cost  functions  would  de- 
pend on  the  location  of  the  structure.   A  family  in  New  York 
City  with  high  land  values  would  therefore  face  a  different 
marginal  cost  pattern  than  a  family  in  rural  Alabama  with 
relatively  low  land  values. 


Wliile  in  general  the  choice  between  two  perfect  substitutes 
results  in  an  either-or  situation  there  is  one  exception.   If 
the  marginal  rate  of  substitution  and  the  price  ratio  have 
identical  slopes,  then  the  consumer  is  indifferent  between 
any  combination  of  the  two  conimodities .   Unless  the  individual 
could  rent  three  rooms  of  a  five  room  house  and  own  the  other 
two,  this  would  not  be  the  case  for  housing.   For  most  con- 
sumers the  implicit  time  cost  of  consuming  two  separated  hous- 
ing units  constrains  them  to  an  either-or  choice.   Of  course 
there  may  be  situations  where  a  family  would  simultaneously 
own  and  rent.   One  example  is  the  businessman  who  owns  a  house 
in  tl"\e  suburbs  but  also  rents  an  apartment  near  work.   For  him 
the  opportunity  cost  of  commuting  every  day  outweighs  the  cost 
of  renting  an  apartment. 
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Now  consider  housing--or  if  you  like,  housing  services-- 
as  being  made  up  of  several  commodities.   Let  this  set  H  be 
represented  by  a  vector  of  n  different  housing  service  com- 
modities such  that  H  =  [h.]->  i=l  .  .  .  n,  where  h.  is  one 
element  of  the  vector  H.   Next,  let  the  set  of  prices  of 
tliese  n  commodities  be  vector  functions  of  h..   Call  this 
set  R  for  renting  and  0  for  owning  such  that 
R(I1)  =  [r^(h.)  ]  ,"i  =  1  }  .  .  n 
0(11)  -  [o^  (h^)  ]  ,  i  =  1  .  .  .  n 

R'  (H)  =  [r  '  (h.  )  ]  ,  i  =  1  .  .  ".  n 

0- (H)  =  [o|(h^) ] ,  i  =  1  .  .  .  n 
where  R'  (II)  and  O'  (H)  are  vectors  bf  marginal  cost  functions. 
At  any  point  in  time  a  family  can  consume  all  the  h. 's  or 
only  a  subset  of  them.   Utility  theory  states  that  in  equili- 
brium the, family  will  equate  the  ratios  of  marginal  utility 
to  marginal  cost  for  all  goods.   Utility  theory  also  tells  us 
that  each  family  v;ill  liave  demand  equations  for  all  the  h. 
goods.   As  shown  in  the  partial  equilibrium  diagrams  of 
Figure  1,  v;hcthcr  they  own  or  rent  these  goods  will  depend 
on  the  marginal  cost  and  marginal  utility  functions.   But 
at  least  four  problems  arise  when  it  comes  to  estimating 
these  separate  demand  functions.   First,  a  family  generally  , 
does  not  own  housing  commodity  h.  and  rent  housing  commodity 
h..   For  example,  it  would  be  very  difficult  cost  v\?ise  for 
a.  family  to  rent  a  lot  witli  a  nice  location  and  then  build  a 
dream  house  which  they  plan  on  taking  v;ith  them  when  they 
move.   Second,  most  or  all  of  the  h.'s  are  either  unobserv- 
able  or  difficult  to  measure.   Indeed,  many  of  the  h.'s 
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MC  (H) 
'       /'     MCn(H) 


AC  (H) 


House  Size  (H) 


Figure  1.   Partial  equilibrium  housing  demand 
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may  be  subjective  qualities  unique  to  every  family.   Third, 
because  housing  for  any  famij.y  may  only  contain  a  subset 
of  the  h.'s,  we  have  a  limited  dependent  variable  problem. 
Tliat  is,  for  any  h.  we  will.Konly  observe  positive  quanti- 
ties being  consumed  by  some  of  the  families.   For  the  rest 
of  the  families  we  will  observe  some  limited  quantity  being 
consumed.   In  general  this  limited  quantity  is  zero.   Fourth, 

the  price  functions  o.(h,)  a'nd  r.(h"^)  are  hard  if  not  im- 

'^  11        11  .- 

possible  to  observe. 

Ideally  of  course  we  should  estimate  demand  equations 
for  all  tlie  h.  's,  but  given  the  four  problems  just  discussed, 
other  approaches  must  be  substituted.   One  method  is  to 
reduce  housing  to  a  few  reasonable  and  measurable  Lancas- 
train  characteristics  and  estimate  demand  equations  for  each 
of  these  characteristics.   This  is  the  approach  taken  by 
King  (76).   Another  alternative  is  to  simply  define  a  homo- 
geneous good  called  "housing  services."   As  discussed  in 
Olsen  (69)  the  problem  now  reduces  to  the  estimation  of  a 
single  demand  equation.   VJhile  both  of  these  approaches 
have  merit,  they  give  little  insight  into  the  rent-own 
decision.   The  analysis  to  follow  combines  these  two  ap- 
proaches.  The  result  is  a  model  i\;here  the  expenditure 
decision  is  determined  simultaneously  with  the  rent-own 
decision . 

Keeping  in  mind  the  fact  that  families  face  time, 
financial,  and  budget  constraints,  the  model  can  be  stated 
as  follows.   The  family  can  choose  to  purchase  or  rent 
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some  amount  of  the  homogeneous  housing  commodity  H.*. 
Here  H.*  is  simply  one  of  the  many  possible  subsets  of 
n  such  that  I!.*  =  F  .  (II)  ,  j  =  l  .  .  .  m.   Here  F.(-)  is  some 
function.   More  precisely,  U  ■*    should  be  viewed  as  a  homo- 
geneous subset  of  H  v/ith  the  remaining  m-j  commodities 
held  constant  at  some  level.    This  fixed  level  can  be  zero 
or  any  positive  number.   The  idea  here  is  to  break  housing 
up  into  several  types  much  like  one'^can  break  fruit  up  into 
apples,  oranges,  and  pears.   For  example,  you  often  hear 
a  family  say  they  are  looking  for  a  "big  house  in  the  city," 
or  "a  ranch  house  in  the  suburbs,"  or  "a  split  level  just 
outside  the  city  with  a  pool,"  or  "a  two  bedroom  apartment 
near  work,"  or  perhaps  "just  a  one  bedroom  unit  near  a 
shopping  center."   After  pricing  these  specific  types  of 
housing,  the  family  may  or  may  not  settle  on  their  first 
choice.   They  will  of  course  settle  for  some  unique  com- 
bination of  the  set  H.   The  simplifying  argument  made  in 
this  paper  is  that  we  can  view  this  decision  as  a  choice 
of  one  type  of  housing  from  a  set  of  several  alternatives. 
The  per  unit  cost  of  buying  H.*  is  represented  by  0.*(H.*) 
such  that 

O.MH.*)  =  f^[0(H)  ]  ,  j  =  1  .  .  .  m, 
where  f   is  some  function.   Similarly,  the  per  unit  cost  of 
renting  H.*  is  given  by  R.*  such  that 

R.*  (H.M  =  fp[R(H)  ]  ,  j  =  1  .  .  .  m, 
3         3  '^ 

where  f   is  some  function.   If  we  ignore  for  the  moment 


3 
If  H*  =  H,  then  this  model  reduces  to  the  one  discussed 

in  Ol^en  (69) . 
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the  possible  tax  reductions  from  owning,  then  if  the  family- 
were  to  purchase  housing  the  maximum  utility  obtained  would 
be  derived  from 

(1)  Max"  (JJ  (X,  H  .*) 
Subject  to  F°(X,  H.*),  j  =  1  .  .  .  m,  where  X  is  a 

bundle  of  all  other  goods  including  saving,  F°(X,  H.*)  is 

« 

the  "owning  consumption  possibility  frontier."   In  this 

simple  case  F°(X,  .H_.  *)  is  given  by  "^ 

F°  (X,  n.*)  =  Y-P  (X)X-O.*  (H  .*)H  .*,  . 
3  J    1    3 

where  0.*(II.*)  includes  any  financial  payments  or  opport- 
unity costs  and  H.*  only  includes  that  part  of  the  dur- 
able good  bought  for  consumption  purposes.   That  part  of 

II.*  purchased  as  an  investment  is  included  in  X.   Y  is 
3       ^     ■ 

income  in  period  t.   The  first  order  conditions  to  (1)  are 

U       F° 
(2)    ^       ^^ 


IJ  p  o       / 

-^H  .*       H  .*  ■■ 

^         3      .  ■    . 

where  U  ,  U,,  ,  ,  F°  ,  and  F°,,  .  are  partial  derivatives.   U  , 
X    11  .  *     X         }{  ■  *  X 

3  3 

U,,  J.,  F°  ,  and  F°,,  ,  are  nothing  more  than  the  marginal 
H  .  '^      X  H  .  '^ 

3  3 

utility  of  X,  the  marginal  utility  of  H.*,  the  marginal  cost 

of  X  and  the  marginal  cost  of  buying  H.*,  respectively.   When 

(2)  holds  let  the  utility  obtained  be  represented  by  U^(H.*), 

v/hore  j  =  1  .  .  .  m.   Assume  that  when  owned  the  Kth  subset 

of  H  produces  the  Iiighest  level  of  utility.   Let  this  utility 

level  be  represented  by  U  *(FI,*). 

A  similar  analysis  ho.lds  for  renting.   If  the  family 

rents  the  subset  H.  *  the  maximum  utility  would  be  derived 

3 

from  (3) 
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(3)  Max  U(X,  H  .*) 
Subject  to:   F^'(X,  H  .*)  , 

where  F  (X,  It.*)  is  the  "renting  consumption  possibility 

frontier"  and  is  given  by  -i^  • 

F^(X,  H.*)  =  Y-P(X)X-R.*  (H  ,*)  II  .*. 
3  .      D    D    J 

The  first  order  conditions  to' (3)  are 

U      *  F^       • 

(4)  ^  =-V^'  ^ 

1  U  .*  ■ 

:) 

n 

where  F    ^  is  the  marginal  cost  of  rerjting  H.*.   When  (4) 

J  ■* 

holds  let  the  utility  obtained  by  represented  by  U  (H.*), 

^   3 

where  j  =  1  .  .  .  m.   Assume  that  the  Lth  subset  of  H  pro- 
duces the  highest  level  of  utility  by  renting.   Let  this 
utility  level  be  represented  by  U  *(H  *) .   Figure  2  shows 

K     Jj 

tlie  equilibrium  positions  given  by  U  *(H,*)  and  Un*(H^*). 

OK  K     Jj 

The  family  will  buy  the  subset  of  housing  services  H,  * 

if  U  *{H,*)  is  larger  than  U_*(H  *).   Conversely,  the  family 

O       Iv  K       Li 

will  rent  the  subset  of  housing  services  H^ *  if  U  *(H  *)  is 
greater  than  U  *(II,*).   If  one  includes  the  tax  advantages 

o     1^ 
from  owning,  then  the  "ov/ning  consumption  possibility 

frontier"  constraint  reduces  to 

F°(X,  H.*)  =  (Y-rM)  (1-t) -P{X)X-0.*  (H.  *)H  .*, 

where  r  is  the  interest  rate  on  the  mortgage,  M  is  the  amount 

A 
of  tlie  mortgage  outstanding  and  t  is  the  tax  rate.  '   In 

general  home  owning  will  now  be  more  attractive  (the  frontier 

for  owning  will  slope  more  to  the  right  in  Figure  2),  but  the 

analysis  will  be  the  same  as  before. 


4 
t  could,  of  course,  be  made  a  function  of  income, 
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X 


K^^'^'K^  =  <-(V 


x^ 


>  H, 


FIGURE  2.   Consumption  possibility  curves 


Up  till  now  I  have  proceeded  as  if  the  quantities  .H,  * 
and  iT  *  can  be  observed  and  measured.   Indeed,  I  have  pro- 
ceeded as  if  the  whole  set  of  housing  types  I!.*  can  be  ob- 
served and  measured.   In  pr^^ctice,  both  observation  and 
measurement  are  difficult.   To  get  around  this  difficulty, 
housing  can  be  measured  in  units  of  dollars.   These  dollar 
units  can  then  be  deflated  by  regional  price  indices.   The 
dependent  variable  ^or  both  'Owners  Snd  renters  now  reduces 
to  deflated  dollar  expenditures.   This  view  of  housing  is 
easily  reconciled  with  consumption  theory.   The  consumer 
will  maximize  utility  by  equating  the  marginal  utility  of 
the  last  dollar  spent  on  housing  to  the  marginal  utility 
of  the  last  dollar  spent  on  all  other  goods.   More  speci- 
fically, the  model  developed  in  this  chapter  can  be  written 
as 

(5)Y-,   =X,  B.+U, 
In     In  1     In 

(6)  Y^   =  X^  Bo  +  U^ 
^     2n     2n  2     2n 

(7)I*  =  Zy-U, 
n      n '     n 

where  Y,   is  real  annual  expenditures  on  housing  if  the  family 

In  '^ 

owns,  Y^   is  real  annual  expenditures  on  housing  if  the  family 

rents,  I  *  is  an  unobservable  index,  X,  ,  X-,   and  Z   are  vectors 
n  '   In'   2n       n 

of  independent  variables  and  {U -,     ,    IK     ,  U  )  are  trivariate 
'^  In    2n    n 

normally  distributed.   Also,  the  model  assumes  Y,   and  Y„ 
are  mutually  exclusive  and  cannot  be  observed  simultaneously 
for  any  one  individual.   One  either  observes  the  f£imily  own- 
ing and  spending  Y,   if  I  *  >  0,  or  renting  and  spending  Y~ 
-^       '-      -^   In      n   —  -'       ^  -^   2n 

if  I  *  <  0.   More  formally,  this  is  a  'sv/itching  regression 
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model  with  sample  separation  and  can  be  written  as 

(8)  Y,   =  X,  6.  +  U 
In     In  I     n 

{9)    Y^      =    X^    Bn    +-    U 
2n     2n-^2     n 

but  we  only  observe  Y  ,  whelre 
-^  n 

Y  =  Yt   iff  Z  Y  >  U  '    • 
n     In       n '  —  n 

(10) 

Y  =  Y-   iff  Z  Y  <  U 
n     2n  ^     n      n 

This  switching  regression  model  is  nothing  more  than  an 
extension  of  Tobin's  (58)  paper.   In  Tobin's  paper  the 
dependent  variable  is  zero  or  some  constant  if  a  family  does 
not  have  expenditures  on  a  durable  good.   In  this  paper  the 
dependent  variable  is  rent  expenditures--a  random  variable-- 
for  families  who  do  not  own. 

In  this  section  I  have  described  a  switching  regression 
model  with  sample  separation  v;here  the  expenditure  equations 
and  choice  function  are  simultaneously  determined.   Although 
the  model  itself  is  simple,  its  estimation  is  not  straight- 
forward.  The  estimation  problems  stem  from  the  fact  that  U 

is  assumed  to  be  correlated  with  U,   and  U^  . 

In       2n 

2 .   Specification  of  the  Model 

The  model  was  estimated  on  a  sample  of  3,028  families 
from  A  Panel  Study  of  Income  Dynamics,  Volume  II  (72), 
(henceforth  tiie  "Panel").   These  data  include  voluminous 
questionnaire  data  for  a  sample  of  3,452  American  families 
over  a  period  of  five  years.   A  detailed  explanation  of 
this  data  base  is  presented  in  the  Appendix. 

For  empirical  purposes,  a  logarithmic  functional  form 
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V7as  used  in  the  demand  equations  and  the  choice  equation. 
This  makes  the  estimated  demand  equations  compatible  with 
previous  studios. 

The  dependent  variable  .was  housing  expenditures-- 
described  as  liousing  cost  in  the  Panel--divided  by  a  price 
of  housing  index.   This  variable  includes  utility  payments, 
amount  saved  on  additions  and  repairs. when  the  work  was 
done  by  a  family  member,   property  t^xes  for  homeowners, 
6%    of  house  value  for  homeowners  and  annual  rent  payments 
for  families  who  rent.   That  is,  the  department  variable 
is  units  of  housing  as  measured  in  real  annual  dollar 
expenditures.  '* 

The  primary  independent  variable   was  a  measure  of 
permanent  income.   The  variable  used  was  a  five  year 
average  of  "family  money"--as  described  in  the  Panel--plus 
inputed  rental  income  for  homeowners  (i.e.  6%  of  net  equity), 
all  divided  by  a  general  price  index.   This  variable  in- 
cludes labor  income  of  head  and/or  wife;  asset  income  from 
farm  or  business;  rental,  interest  and  dividend  income; 
and  transfer  payments  such  as  Aid  to  Dependent  Children. 

Other  independent  variables  included  city  population 
and  distance  from  the  center  of  the  nearest  city  of  50,000 


"Amount  saved  on  additions  and  repairs  was  included  to  make 
housing  expenditures  for  renters  and  ovmers  comparable.   In 
general-  very  few  renters  do  their  own  maintenance-  work. 
T'nis  is  because  their  rental  payments--and  therefore  housing 
expenditures--include  an  implicit  maintenance  cost. 
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population  or  more.    These  two  variables  were  included  to 
capture  the  effect  of  higher  land  values  as  land  becomes 
relatively  scarce  in  densely  populated  areas.   Also,  to 
capture  demographic  differances,  family  size  was  included 
as  an  independent  variable.   This  variable  is  defined  as 
a  number  of  people  (children  plus  adults)  living  in  the 
family  unit.  ■  • 

Dummy  variables  for  age',  sex,  ^nd  race  of  the  family 
head  were  included,  as-  was  a  dummy  variable  if  the  family 
moved  more  than  once  between  196  8  and  1972.   The  reason 
for  including  this  last  variable  in  the  expenditure  equa- 
tions was  to  capture  the  different^  search  costs  as  well  as 
different  demographic  factors  of  the  mobile  families.   It 
was  included  in  the  choice  equation  for  other  reasons. 
The  moving  family,  because  of  the  transactions  cost  in- 
volved in  buying  a  home,  is  more  likely  to  rent  than  ov;n . 

.  Finally,  six  price  indices  were  included.   Each  house- 
hold has  associated  with  it  a  relative  (to  all  other  goods) 
price  of  renting,  a  relative  (to  all  other  goods)  price  of 
owning,  a  price  of  renting,  a  price  of  owning,  a  price  for 
all  goods  other  than  housing  if  the  family,  rents  and  a 
price  for  all  goods  other  than  housing  if  the  family  owns. 
These  price  indices  were  based  on  data  taken  from  the 


In  the  Panel  those  tv;o  variables  were  coded  in  groups.   For 
example,  city  size  was  classified  as  being  in  one  of  six 
groups:   greater  than  500,000;  100,000  to  499,999;  etc.   To 
make  the  variable  continuous,  a  real  number-- the  midpoint, 
when  appropriate--was  used  for  each  group.   That  is,  for 
the  two  above  examples  the  family  was  assigned  a  value  of 
1,000,000  or  300,000,  respectively. 
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Bureau  of  Labor  Statistics  flU,S)  Handbook  of  Labor  Statistics, 


1972.   The  three  "renting"  price  indices  were  constructed 
from  Table  136  in  the  BLS  -  "Annual  Budgets  at  a  Lower  Level 
of  Living  for  a  4-person  Fa-mily,  Autumn  1971."   This  table 
breaks  the  budget  into  nine  categories  for  forty-four  cities 
and  nonme hropolitan  areas  in  the  United  States.   The  housing 
expenditure  data  assumed  all  families  were  renters  and  was 
used  to  construct  the  rent  index.  'fhe   price  index  for  all 
other  goods  was  constructed  by  subtracting  the  housing  expen- 
ditures from  the  total  budget.   Similarly,  the  three  "owning" 
price  indices  were  constructed  from  Table  138  in  the  BLS  - 
"Annual  Budgets  at  a  Higher  Level  of  Living  for  a  4-person 
Family,  Autumn  1971."   In  this  table  the  housing  expendi- 
ture data  were  weighted  by  the  following  proportions:   15 
percent  for  rental  cost  and  85  percent  for  homeowners  cost. 

Using  these  two  BLS  tables,  it  was  therefore  possible 
to  assign  one  price  index  for  renting  and  another  for  own- 
ing to  each  of  the  forty-four  areas.   Depending  on  v;here 
they  lived,  each  family  in  the  Panel  could  then  be  given 
three  price  indices  for  owning  and  three  for  renting.   For 
example,  if  a  family  lived  in  the  Boston  SMSA  the  price 
indices  for  Boston  were  used.   If  a  family  lived  in  the 
Nortlieast  and  in  a  city  of  less  than  50  thousand  popula- 
tion, then  the  price  indices  for  nonmetropolitan  areas  in 
tlie  Northeast  were  used.   A  similar  procedure  was  followed 
for  other  parts  of  the  country. 

Altliough  the  above  price  indices  made  careful  use  of 
all  available  data,  at  least  five  cautions  should  be  made. 
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First,  section  one  of  this  chapter  argues  that  housing  is 
really  a  bundle  of  several  goods.   One  price  index  is  only 
an  approximation  for  several  price  indices  and  therefore 
its  coefficient  is  only  a  surrogate  for  several  price 
elasticities.   Second,  Polinsky  (75)  notes  that  even  if 

housing  is  a  homogeneous  good,  using  metro  housing  price 

« 

indices  ignores  price  variations  among  observations  from 

a  given  metropolitan  area.   T^his  car?  cause  downward  biased 
estimates  of  income  elasticity  and  upward  (tov/ard  zero) 
biased  estimates  of  price  elasticity.  "Third,  the  price 
index  for  ov;nina  vjas  based  on  1971  data.   Unless  relative 
prices  across  cities  remain  constant  over  time,  the  nrice 
index  for  owners  is  only  valid  for  families  who  bought 
their  homes  in  1971.   Fourth,  since  the  rent  index  was 
based  on  lower  income  families,  it  may  not  reflect  the 
true  price  of  renting  for  higher  income  families.   Simi- 
larly, the  owning  price  index  may  not  reflect  the  true 
price  of  owning  for  lower  income  families.   Fifth,  families 
may  believe  that  the  selling  price  of  houses  will  rise 
faster  than  the  general  price  index.   This  may  encourage 
them  to  buy  a  house  larger  than  their  consumption  needs. 
Tlieir  dollar  expenditures  on  housing  would  now  reflect 
investment  as  well  as  consumption  decisions. 


CHAPl^R  IV 
TWO  STAGE  ESTIMATION 

1 .   Estimation  Problems     «  .  , 

Because  of  the  assumption  of  correlated  disturbances, 
estimation  of  equations  (5)  and  (6)  in'  Chapter  III  with 
ordinary  least  '  squares  is  biased  and  inconsistent.   As  in 
the  Tobit  model,  this  bias  and  inconsistency  is  due  to  the 
presence  of  a  disturbance  term  whose  expectation  is. nonzero 
and  not  constant  for  all  observations.   In  the  econometric 
literature  the  expectation  of  this  disturbance  term  is 
often  referred  to  as  a  "Missing  Variable."   More  specifi- 
cally, it  can  be  shov/n  that 

E(U,  |z'y  >   U  )  7^  0,  E(Ut  I  Z  y  <  U  )  7^  0 
1  n '  n   —    n   '        2  n  '  n '     n 

and  also  that  the  error  terms  U   ,  u^   v;ill  be  correlated  with 

In    2  n 

their  regressors.   Thus,  the  direct  least  squares  method  can- 
not be  applied  and  other  consistent  estimation  methods  must 
be  used.   While  one  possible  procedure  is  maximum  likelihood 
estimation,  Lee  (7  6)  has  proposed  tv/o  much  simpler  computa- 
tional "two  stage"  methods.   These  procedures  can  be  applied 
to  equations  5-7  of  Chapter  III  as  follov;s. 

Lot  the  observed  sample  separation  be  denoted  by  a 

dichotomous  variable  I  .   That  is, 

n 

(1)   1=1  iff  Z  Y  >  U 
n  n   —   n 

=  0  iff  Z  Y  <  U  . 
n      n 
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If  v/G  assume  U   is  normally  distributed  the  above  can  be 
written  as  the  follov;ing  model  '  ■ 

(2)  I    -  Z  Y  -  U 

n      n      n 

but  we  only  observe""  I  ,  where: 

(3)  I   =liffl*>0 

n  .   n   — 

=  0  iff  I  *  <  0.  - 
n 

Finally,    the   Probit   model   yt^ilds    the    relationship 

( 4 )  I      =    F  ( Z    Y )    +    £     ,      '-»  % 

n  n  n '  * 

where  F  is  the  cumulative  distribution  of  a  standard  normal 


variate.  ^ 

The  first  procedure  proposed  by  Lee  (1976)  uses  the  com- 
bined owner  and  renter  sample  and  estimates  6-,  and  6^  together, 
To  derive  this  procedure  let  Y   be  the  observed  sample  of  the 
endogenous  variable  for  the  n    family.   Combining  the  two 
separate  equations  into  one  single  equation,  we  have 

Y=IY-,+(l-I)\'' 
n     n  In  n    2n 

(5)  =  I  X-,  p,  +  (1  -  I  )X„  Bt  +  I  U,   +  (l-I  )Ut  . 

n  In  1  n   2n  2     n  In        n   2n 

Although  (5)  looks  like  a  single  regression  equation,  ordinary 

least  squares  cannot  be  applied  directly  to  it.   First,  since 

U   of  equation  (2)  is  correlated  with  U,   and  U^"  the  error 
n      -^  In       2n     • 

terms  do  not  have  zero  mean.   In  fact,  i-je    have 

(6)  Ed^U^,^)  =  p^a^/'"\l^f(UjdU^ 

—  no 

Z  Y 

(7)  E(l-I  )Ut   =  -P^o^f    "  U  f(U  )dU 

n   2n     '22      n    n    n   , 


wh 


ere  n,,  o^,  p,,  p„  denote  the  standard  deviations  of  U,  , 


U^   and  the  correlation  coefficients  between  U,   and  II   and 
2  n  Inn 

betv^/ecn  U„   and  U   respectively.   Also,  f  (U  )  is  the  standard 

2n       n     '^        -'  n ■ — 

normal  density  function.   Thus,  the  mean  error  term  of  (5)  is 
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(8)  E[I,^l-^^  4-  (1  -  IJU^,^]  =  (p^o^-p^o^)/  "  U^f(U^)dU^ 

—  CO 

Furthermore,  the  error  terms  of  the  equation  are  also 

correlated  with  the  regressor's  -since 

Z  Y 

(9)  EtI  U,  I  X,  ]  =  El  I  U,  ]Xt   =  Pia,X,  /  "  U  f(LI  )d   7^  0, 

n  In  n  In      "^  n  In   In     1  1  In      n    n   n  ^ 

—00 

and 

(10)  E[(l-I  )Ut  (l-I  )X„  ]  =  E[(l-I  )U_  ]X_ 

n   2n     n  ^2n  n   2n   2n 

Z  Y  "   -  - 

=  -p„0-.X^  /  "  U  f  (tl  )dU   ^  0; 
•^222  n      n^nn'^ 

—  no 

In  order  to  estimate  (5)  we  must  first  adjust  the  mean  of  the 
error  terms  to  zero  and  take  care  of  ttje  correlation  between 
the  error  terms  and  the  regressors.   We  now  have 

(11)  Y,^  =  F(Z^^Y)X^^B^  -f  (1-F(Z^^Y))X2^62 

Z  Y 

+  {p-,0,     -p^o^)f    "  U  f(U  )dU   +  W  , 
112-2       n    n    n     n' 

—  00 

7.  Y 

where  W   =  I  u -,   +  (l-i  )]j^      -    (p,a,  -p„a_)/  "  U  f(U  )  dU 
n     n  In        n   2n    ^11   '2  2       n    n    n 

—  CO 

+   (X,   P,   -  Xt   Bo)  E   . 

In  1     2n  2   n 

It  can  be  easily  shov;n  that  E  (VJ  )  =  0  and  W   is  uncorrelated 

-^  n  n 

with  the  regressors. 

Hov/ever,  equation  (11)  is  non-linear  in  the  parameters 
Yf  B-,  and  ^^^       ^^^    since  sample  separation  information  is 
available,  a  simple  technique  to  estimate  the  unknown  co- 
efficients 3-,  and  Bn  is  possible. 

First,  equation  (4)  can  be  estimated  by  the  usual  Probit 
analysis.   This  is  the  first  stage  of  the  estimation  procedure. 
The  second  stage  is  to  estimate  the  following  equation  by 
ordinary  least  squares, 

(12)  Y^^  =  F(Z,^9)X^,^6,  +  (1-F(Z^;))X2„B2 

+  (p-^o^  -P^a^)f    "  U^,f(UjdU^  +  W^ 


—00 
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Since  y  is  a  consistent  estimate  of  y  ,  W   v/ill  have  an 
'  '    n   ■ 

asymptotically  zero  mean  and  be  asymptotically  unrelated 

with  the  reqressors.   Even  thoUqh  W   does  not  have  the 

same  variance  for  each  n,  the  estimated  B-i  and  3^  by  ordinary 

least  squares  will  be  consistent. 

To  see  how  B -,    and  B„  are  identified,  equation  12  can  be 

rewritten  as  .w 

(12a)   Y   =  X,  B,  +  X„  6t  +  aX-,   ■+  VJ  , 
n     In  1     2n  2      3n     n  . 

where  X,    =F(Zy)X,  ,  X^   =F{-ZY)X^  ,  ra=  (p. a,    -p„o„) 
In   „     n'   In'   2n        n''  2n'       '1  1    ^2  2 

*      Z  Y  -      . 

and  X-,   =  /    U  f(U  )dU  .   When  all  the  independent  variables 
3n         n    n    n  ^ 

in  X,  and  X„  are  different,  OLS  on  (12a)  yields  estimates  of 
6-,,  6^,  Var  ( B -,  )  and  Var  (B„)  directly.   However,  if  X,   = 
X^  ,  OLS  on  (12a)  will  only  give  estimates  of  B^,  (B-,  -  By)r 
Var  (B-,)  and  Var  (B,  -  B^).   This  is  because  (12a)  now 


reduces  to 


*  *     % 


(12b)   Y   =  X  Bn  +    X  (B,  -  Bo)  +   X,   +  W  , 
n     n  2     n   1     2       3n     n 

*        -^  1 

where  X   =  X -,   =  X^   and  X   =  F(Z  y)X  .   Estimates   of  Var 
n     In     2  n       n       n    n 

(6-,)  canbe  obtained  by  using  OLS  on  the  following 

(12c)   Y   =  X  B,  +  X   (Bn  -  Bi)  +  aX-,   +  W  , 
n     nl     n2     1       3n     n' 

where  X    =  F (-Z  y)X  .   Since  the  three  sets  of  independeht 

n         n '   n  ^ 

variables  in  equations  12b  and  12c  will  be  highly  correlated, 
estimation  of  B-,  and  B-j  v/ill  have  multicollinearity  problems. 

In  order  to  avoid  these  multicollinearity  problems  a 
second  tv.'o  stage  technique  is  available.   This  method  esti- 
mates Bn  using  only  the  owner  data  and  B-,  using  only  the 
renter  data.   That  is,  consistent  estimates  of  B-.  are 


The  standard  errors  will  bo  slightly  biased.   This  is  because 
unbiased  estimates  of  a,  and  a^  require  a  somewhat  different 
formulation  than  the  one  used  in  OLS. 
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obtained  by  using  OLS  on 


(13)   Y^^^  =  X^^Bi  +  a 


In^-'l  ■  '^'liJ  F(Z  y)  ^  ^  ^^In 


Similarly,  consistent  estimates  of  B-j  can  be  obtained  by 

using  OLS  on 

UZy) 


^ 


f^' 


In  the  two  above  expressions  a-.       and  o -.      are  the  covari- 

'^   ,         lu       2u 

ances  between  U,   and  U  ;  and  U^   and  U   respectively.   V, 

l)-i       n    <•    2n       n     ^        -^     In 

and  V_   are  stochastic  disturbance. 
2n 

Equation  (13)  is  derived  by  taking  the  expectation  of 

Y,   conditional  on  I   =1  and  adjusting  the  conditional  mean 
In  n  -^  ,-' 

of  the  error  term  to  zero.   That  is,  •   , 

(15)   E(Y,  ll   =  1)  =  X,  Bi  +  E  (U,  ll   =  1) 
In '  n  In  1       In  '  n 

=  X,  6t  +  /   U,  g(U,   I   =  l)dU,  , 
In  1        In^   In  '  n        In ' 

where  g(U-,  I  I   =1)  denotes  tlie  conditional  density  of  U, 

-^   In'  n  -'      In  ^ 

conditioned  on  I   =1.   Before  proceeding,  rewrite  q(U,  I  I   =  1) 

n  ^         -^ '  -   In '  n 

as  the  joint  density  divided  by  the  marginal  density, 

1) 


16)   g(u^Ji^  =  i; 


^   In    n 


g(i^  =  1) 

Z  Y 

/  "  f  (U,  ,  U  )dU 
-^ In    n    n 

Z  Y 

/  "  f  (U  )dU 
n    n 


,     Z  Y 

-ri — r/  "  f(UT  ,  u  )du  , 
F(Zj^Y)„„      In'   n'   n' 


Z„Y 


n 


F(Z^  ) 


f(U,   U  )f(U  )dU  , 
In '  n     n    n ' 


where  f  (U-|^^,Uj^)  and  f  (U  ^^^J  U^^)  are  the  joint  and  conditional 
bivariate  normal  density  functions,  respectively.   By 


substituting  (16)  into  (15),  it  follows  that 


29 


(17)   E(Y^Jl^  =  1)  =  X.,,^3, 

+  FUTT  ^'"'(C^ln^^"lnl"n^^"ln^f-(^V^^"n 


n 


-oT- 


By  substituting  in  for  f  (U,  lu  )  when  a  2  =  1  and 

^                                -"  In  '  n         u 

integrating,  we  get 

(18)   E(Yt  I  I   =  1)  =  X,  6i     ^ 

In '  n  .JLn  1 


n    -« 

""   Z  Y 

/  "  U  f (U  )dU 
-«>    n    n    n 


=  X,  (?,  +  o. 
In  1     lu 


F  (Z  Y) 
n  ' 


X,  6i  +  a. 
In  1     lu 


-f(Z,^Y) 


F(Z^Y) 


Finally,  to  use  OLS  on  Chapter  Ill's  equation  (8)  directly, 
we  need  to  adjust  the  conditional  mean  of  the  error  term 
to  zero.   Doing  this  we  get 


<i^'^)  ^m  =  ^mP'i  -^  ^lu 


F(Z^Y) 


+  U,   -  o. 
In     lu 


-f  (Z  y) 
n  ' 

F  (Z  y) 
n  ' 


=  HnH   '-    "lu 


-f (Z  Y) 
n 

F  (Z  Y) 
n  ' 


+  V 


In 


where  V 


In 


1  n     ]  u 


-f(Z,^Y) 


F(Z,^Y) 


and  E(V-,  I  I   =  1)  =  0 
1  IT '  n 


Similarly,  Equation  (14)  is  derived  by  taking  the 
expectation  of  Y^  conditioned  on  I^^  =  0  and  adjusting 
the  conditional  mean  of  the  error  term  to  zero. 
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As  mentioned  earlier,  tliis  two  stage  approach  avoids 

the  raulticollineari ty  problems  that  arise  when  X,^  =  X^^. 

-'    ^  In    ■  zn 

Since  nine  of  the  ten  variables  in  X,   and  X^   are  the 

In       lx\ 

same  for  the  model  specif. 164  m  Chapter  III,  P,  and  62 
were  estimated  from  equations  13  and  14  respectively. 


2.   Discussion  of  the  Results 


The  model  was  estimated^ using  1971  data.   The  Probit 
estimates  of  the  choice  equation  are  reported  in  Table  1. 
The  OLS  and  two  stage  estimates  are  reported  and  compared 
in  Tables  2  and  3.   A  discussion  of  the  results  found  in 
these  tables  follo\/s. 

Table  1  contains  the  Probit  results  with  eleven  in- 
dependent variables.   Both  age  dummies  are  significant 
and  indicate  that  families  with  a  head  over  64  are  the 
most  likely  to  own  their  ov/n  home  and  as  expected,  families 
headed  by  an  individual  under  36  are  the  least  likely  to 
own.   The  coefficient  for  the  Black  dummy  is  highly  signi- 
ficant and  negative,  indicating  that  Blacks  are  more  likely 
to  be  renters  than  owners.   This  could  be  due  to  either 
price  discrimination  or  merely  a  stronger  preference  for 
those  housing  goods  that  are  cheaper  to  consume  by  renting. 
A  similar  interpretation  applies  to  the  negative  sign  for 
the  female  dummy. 

The  negative  coefficient  for  the  mover  variable  sug- 
gests that  transactions  costs  of  buying  and  selling  a  house 
effectively  raise  the  price  of  a  home  for  the  family  that 
frequently  moves.   These  families  are  therefore  more  likely 
to  rent.      •  ■   . 
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TADLp  1 
Probit  Coefficients  for  Owners* 


Intercept 

Age  1  35      ■ 

Age  36-64 

« 

Black 

Female 

Hover 

Log  (City  Size) 

Log  (Distance  from  Center  of  City) 


Log  (Fajnily  Size) 


Log  (Real  Permanent  Income) 


Log  (Relative  Price  of  Renting  Housing) 


Log  (Relative  Price  of  Owning  Home) 


Number  of  Owners 


-4.6586 
(.5350) 

-1. 2001 
(.1094) 

-. 6567 
(.0994) 

-. 3256 
(, 0728) 

-.2910' 
(.0682) 

-.9591 
(. 0721) 

-.1060 
(.0199) 

.1389 
(. 0276) 

.  1328 
(.0498) 

.7586 
(.0535) 

2.1735 
(.4015) 

-.3684 
(. 3793) 

1803 


Number  of  Renters 


1225 


'Standard  errors  in  parentheses 
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TABLE  2  _      . 

Comparison  of  Two  Stage  and  nhS   Estimates  for  Renters* 


Intercept 

Age  f^  3  5 

* 

Age  36-6  4 

Black  ^  . 

Female 

Mover 

Log  (City  Size) 

Log  (Distance  from  Center  of  City)  . 

Log  (Family  Size) 

Log  (Real  Permanent  Income) 

Log  (Relative  Price  of  Renting  Housing) 

Missing  Variable 


r2 


Number  of  Observations 


Two  State 
Estim.ates 

2. 

OLS 

2.0515 

,0965 

-.1994 
(.0990) 

.1774 
.0580) 

-.1320 
(.0730) 

.1190 
.0558) 

-.1823 
(.0494) 

.1776 
,0464) 

.1061 
(.0329) 

.1107 
.0282) 

.  1264 
(.0593) 

.  1407 
.0286) 

.0564 
(.0100) 

.0581 
.0076) 

.0005 
(.0310) 

- 

.  0017 
.0299) 

.0882 
( .0215) 

.0863 
.0204) 

.  5031 
(.0506) 

.4908 
. 0236) 

-1.2872 
(.2265) 

-1 

.3216 
.1885) 

-.0329 
(.1198) 



.4365 

.4  369 

1225 

1225 

*Standard  errors  in  parentheses 
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TABLT  3 

Comparison  of  Two  Stage  and  OLS  Estimates  for  Owners* 

Two  Stage 
-  "^      '  Estimates     OLS 

Intercept  -    .  2.0392  2.0909 

Age  <  35  '  .   '  '    -.1454  -.1343 

(.0693)  (.0407) 

Age  36-64  '  .    '  "         -.1563  -.1505 

^  (.0440)  (.0330) 

Black  ■       -.2534  -.2502 

(.0585)  (.0561) 

Female  "        .0772  .0006 

(.0352)  (.0308) 

Mover  •   -.1018  -.0909 

(.0660)  (.0367) 

Log  (City  Size)  .0282  .0295 

(.0088)  (.0060) 

Log  (Distance  from  the  Center  of  City)         .0193  .0182 

(.0363)  (.0357) 

Log  (Family  Size)  ,0298      .0285 

(.0234)  (.0224) 

Log  (Real  Permanent  Income)  "        .5712      .5644 

(.0396)  (.0206) 

Log  (Relative  Price  of  Owning  Home)  .1784  .1768 

(.1914)  (.1912) 

Missing  Variables  -.0195       

(.0979) 

r2  .4477  . 4490 

Number  of  Observations  1803       1803 


*Standard  errors  in  parentheses 
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The  next  two  variables,  city  size  and  distance  from  the 
center  of  the  city,  indicate  that  in  rural  ar.eas  the  family 
is  more  likely  to  own  their  own.  home.   This  could  be  caused 
by  higher  land  values  in  th"fe  more  densely  populated  areas. 
In  populated  areas  where  land  is  scarce  -  and  therefore 
expensive  relative  to  land  in  rural  areas  -  one  would  expect 
to  find  more  people  living  in  land  ecionomizing  apartments. 
Conversely,  in  rural  areas  one  would  expect  to  find  a  rela- 
tively limited  number  of  people  living  in  apartments.   For 
example,  few  families  could  afford  to  Guy  land  and  build 
single  family  houses  in  downtown  New  York  City.   Instead  of 
houses  we  see  high  rise  apartments' that  economize  on  the 
scarce  input  land.   Based  on  mere  availability  then,  it 
follows  tliat  city  size  should  have  a  negative  sign  and 
distance  from  the  city  a  positive  sign  in  the  choice  equation. 

The  coefficient  for  family  size  is  positive.   This 
indicates  that  larger  families  prefer  the  more  spacious 
living  conditions  provided  by  home  ownership.   The  positive 
coefficient  for  the  income  variable  suggests  that  lower 
income  families  are  unable  to  obtain  a  mortgage  for  the 
size  of  house  they  desire  and  therefore  rent  rather  than      ^ 
own.   Finally,  the  two  relative  price  variables  have  reason- 
able signs.   As  the  price  of  renting  goes  up  or  the  price  of 
owning  goes  down,  a  family  is  more  likely  to^  own  their  own 
home . 

Tables  2  and  3  contain  the  two  stage  estimates  of  the 
demand  equations  (equations  13  and  14).   For  comparison  pur- 
poses these  tables  also  contain  OLS  estimates  of  the  renter 
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and  owner  demand  equations.   These  OLS  estimates  were  ob- 
tained by  splitting  the  sample  into  tv/o  groups,  renters 
and  owners.   Demand  equations  v;ere  separately  estimated 
for  each  group.   A  discussio"~n  of  the  two  stage  estimates 
follows. 

The  coefficients  for  the  -age  dummies  are  negative 
and  significant  for  both  ovm'ers  and  ];-enters.   For  ov/ners 
the  coefficients  give  a  U-shnped  pattern  of  housing  ex- 
penditures versus  age  of  head  at  given  income  levels. 
For  renters,  expenditures  on  housing  increase  as  age  in- 
creases.  The  coefficients  for  owners  are  consistent  with  a 
theoretical  result  derived  in  Muth..  (74).   Using  what  he 
calls  the  "income  effect,  price  effect  and  length  of  stay 
effect,"  Muth  presents  a  justification  for  this  U-shaped 
pattern. 

The  black  and  female  dummies  indicate  that  blacks 
spend  less  on  housing  than  nonblacks,  while  females  spend 
more  on  housing  than  males.   While  this  may  suggest  the 
existence  of  some  discrimination  against  blacks  in  1971, 
it  does  not  prove  it.   In  general  the  outcome  for  both 
blacks  and  females  largely  depends  on  tlieir  preferences 
relative  to  others.   Ceteris  paribus,  a  stronger  (weaker) 
preference  for  housing  relative  to  all  other  goods  v/ould 
result  in  more  (less)  housing  expenditures. 

The  coefficient  for  the  mover  variable  in  the  renter 
group  is  positive.   Therefore,  moving  renters  spend  more 
than  nonmovers.   Mobile  families  may  spend  more  for  rental 
units  because  of  search  costs.   That  is,  since  they  only 
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spend  a  relatively  short  time  in  each  location,  the  marginal  ■ 
benefit  from  an  additional  sf^arch  -  in  terms  of  a  lower  price 
does  not  outweigh  the  marginal  -costs.   For  the  nonmoving 
renter  the  optimal  searching"  rule  -  in  terms  of  the  number 
of  searches  or  stopping  priqe  -  probably  results  in  a  rela- 
tively lower  expenditure.   For  a  review  of  search  theory  see 
McCall  and  Lippman  (75) .   In  the  owner  group  the  mover  co- 
efficient  was  negative.   This  may  be  due  to  three  factors. 
First,  transaction  costs  are  lower  for  less  expensive  houses 
since  realtor  fees  and  the  like  are  a  percentage  of  the  cost. 
Mobile  families  can  economize  on  these  costs  by  buying  less 
expensive  dv/ellings.   Second,  it  may  be  easier  to  resell  a 
lower  priced  house.   Third,  mobile  families  have  little 
incentive  to  "buy  that  dream  house"  and  maintain  it.   Rather, 
they  are  more  likely  to  settle  for  a  simple  place  to  live. 
The  positive  coefficients  for  the  city  size  variables 
indicate  that  both  owners  and  renters  spend  more  on  housing 
as  city  size  increases.   Since  tlie  dependent  variable  is  real 
dollar  expenditures,  the  expected  a  priori  result  depends  on 
the  various  price  elasticities  of  housing  goods.   Since  the 
demand  for  "housing  services"  are  generally  viewed  as  price 
inelastic,  then  the  expected  effect  on  expenditures  would  be 
positive.   That  is,  a  larger  city  implies  higher  land  values 
which  in  turn  imply  higher  housing  prices.   Given  inelastic 
demand,  higher  prices  would  lead  to  more  expenditures  on 
housing.   Also,  if  we  assume  that  all  families  require  some 
minimum  size  house,  and  therefore  some  minimum  size  lot, 
another  explanation  is  possible.  .Once  this  minimum  size  is 
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reached  families  can  no  loncfor  economize  on  lot  size  by 
buying  smaller  lots.   Higher  land  values  therefore  imply 
higher  expenditures  once  this  lot  size  constraint  is 
effective.   The  two  stage  estimates  of  the  distance  co- 
efficients are  positive  for  both  owners  and  renters  but 

are  never  significant.    ■     "      • 

« 

The  coefficients  for  family  size  show  that  as  family 

■w 

size  increases  renters  spend'more  oA  housing.  For  owners 
however,  family  size  has  no  significant  effect  on  housing 
expenditures.  ■* 

As  expected,  if  income  increases  both  owners  and 
renters  spend  more  on  housing.   These  coefficients  are  well 
below  one  on  similar  to  those  found  in  Lee  (68) ,  Carliner 
(73)  and  Fenton  (74)  .  ... 

The  coefficient  on  the  relative  price  variable  is 
negative  for  renters  but  positive  for  owners.   Given  all 
the  problems  of  nonhomogeneity  and  cautions  about  the 
price  indicies  discussed  in  Chapter  III,  these  results 
are  hard  to  interpret.   However,  the  -1.2872  result  for 
renters  is  similar  to  Fenton's  (-1.27,  -1.35  and  -1.09) 
for  all  renter  households,  high  and  lov;er  renter  households, 
respectively).   Also,  the  positive  coefficient  for  owners 
is  not  significant. 

The  coefficients  for  the  "Missing  Variable"  in  the 
renter  and  owner  equations  are  estimates  of  the  covariances 
a,   and  o   ,  respectively.   The  negative  sign,  although  in- 
significant, indicates  the  renters  spend  less  as  renters 
than  they  would  spend  if  they  owned.   Similarly,  owners 
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spend  less  ns  owners  than  tjiey  would  spend  if  they  v/ere 
forced  to  rent. 

Finally,  althougli  the  two  stage  estimates  are  theore- 
tically  better  than  the  OLS-. estimates  >  both  sets  of  results 
are  similar.   The  reason  for  this  similarity  is  the  in- 
significance of  the  Missing  Variables.   This  implies  that 

o,   and  a,,   are  not  significantly  different  from  zero. 
lu       ^u  ^  -' 

■w 

Without  using  the  two  stage. 'method  l^owever ,  this  zero 
correlation  conclusion  could  not  have  been  dravm. 


'  CHAfT^R  V 
MAXIMUM  LIKELIHOOD  ESTIMATION 

Introduction 

— » 

In  Chapters  III  and  IV  .^  housing  expenditure  model  with 
interdependent  choices  between  owning  or  renting  was  studied. 
Because  of  this  interdependence  the  binary  own  or  rent  choice 
functioii  is  determined  simultaneously  with  the  housing'  expen- 
diture equations.   It  is  the  simultaneity  which  makes  that 
model  an  extension  of  past  studies  on  housing.   While  others 
such  as  Poirier  (74)  and  Quigley  (76)  have  presented  models 
of  consumer  choice,  they  did  not  include  an  analysis  of  con- 
sumer expenditure  equations.   Similarly  Lee  (68),  Carliner 
(73),  Fcnton  (74)  and  Polinsky  (75)  have  studied  and  esti- 
mated housing  expenditure  equations  without  considering  the 
implied  choice  equation.   To  the  casual  observer  these  over- 
sights are  minor.   A  closer  examination,  hov/ever,  showed 
that  estimation  of  the  expenditure  equations  is  not  straight- 
forward.  In  general  simple  ordinary  least  squares  (OLS)  on 
the  expenditure  equations  will  lead  to  biased  estimated. 
Other  unbiased  estimation  techniques  have  to  be  used.   One 

4 

sucli  procedure  is  a  two  stage  method  proposed  by  Lee  (76). 
This  is  the  procedure  used  in  Chapter  IV  to  estimate  the 
model.   Hov/ever,  the  estimates  of  the  correlation  between 
the  disturbances  in  the  expenditure  equations  and  the  dis- 
turbance in  tlie  decision  function  were  not  significantly 
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different  from  zero.  This  is  an  important  result  since  it 
implies  that  simultaneity  does  not  exist  and  therefore  OLS 
estimates  of  the  expenditure  equations  are  unbiased. 

In  tliis  chapter  a  furtl^er  test  of  the  simultaneity 
issue  is  made.   To  do  this  I  start  with  the  two  stage 
estimates  as  initial  consistent  estimates  and  then  apply 
a  two  step  maximum  likelihood  procedure  (2SML)  to.  give  the 
final  estimates.   Since  the  ■2SML  apj?roach  does  not  ignore 
the  simultaneous  effects  and  the  usual  OLS  approach  does 
ignore  these  effects,  a  maximum  likelihood  ratio  test  is 
employed  to  compare  the  two  methods.   If  simultaneity  does 
exist  then  this  test  should  show  the  2SML  estimates  to  be 
significantly  better  than  the  OLS  estimates. 

1.   Amemiya ' s  Two  Step  Maximum  Likelihood  Procedure 

Tlie  model  presented  here  is  the  same  as  the  one  in 
Chapter  III  except,  for  tlie  specification  of  the  prices. 
In  Ciiapter  III  two  sets  of  indices  were  used,  one  for 
owning  and  one  for  renting.   This  chapter  only  uses  one 
set  of  aggregate  price  indices  for  both  owners  and  renters. 
A  different  specification  was  employed  because  the  positive 
coefficient  for  the  price  variable  in  the  owner  equation 
obtained  in  Chapter  IV  is  liard  to  justify.   Each  household 
has  associated  with  it  a  housing  price  index  and  a  price 
index  for  all  goods.   The  purpose  of  these  indices  is  to 
standardize  housing  expenditures  and  income  for  different 
cost  of  housing  and  cost  of  living  across  the  U.S.,  res- 
pectively.  Given  this  price  specification,  the  relative 
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price  of  housing  services  would  not  enter  into  the  rent-own 
decision  function.   It  would,  of  course,  influence  the 
allocation  of  the  budget  between  housing  and  all  other 
goods.   More  specifically,  ^he   model  studied  has  the  follov;- 
ing  specification 


t 


(1^  ^m  =  '^m^i  ^   ^^n 

(3)   I*   =  Z  Y  -  t-  r   ''      .  "^ 

n      n '     n 

where  C-,   are  annual  expenditures  on  housing  if  the  family 
owns,  C^   are  annual  expenditures  on  housing  if  the  family 

rents  and  (g,  ,  e .,  ,  e  )  are  trivariate  normally  distributed 

1  n    2  n    n  ^ 

N  (0,  Q) .       Here  Q  is  a  3  x  3  covariance  matrix.   Without 

loss  of  generality  the  variance  of  e  is  assumed  to  be  one. 

I   is  an  unobservable  index.   Also,  assume  C,   and  C^   are 
n  In       2n 

mutually  ejcclusive  and  cannot  be  observed  simultaneously 
for  any  one  individual.   Wliat  is  observed  are  exogenous 

variables  X,  ,  X^  ,  Z  ,  the  binary  irtdex  I   and  expenditures 

In    2n'   n  ^  n        '■ 

on  housing,  C  ,  such  that 
^    n 

{A)       C   =  C,  ;  I   =1  iff  I  >  0, 

n     In    n  —      ■ 

(5)   C   =  C^  ;  I   =0  otherwise, 
n     2n    n 

Tlie  price  indices  were  constructed  from  table  128  in 
the  Bureau  of  Labor  Statistics  (BLS)  Handbook  of  Labor 
Statisbics  1972  -  "Consumer  Price  Index,  23  Cities  or 
Standard  Metropolitan  Statistical  Areas,  All  Items  and 
Major   Groups,  197^1-1971."   This  table  gives  six  price 
indices  -  including  a  housing  index  -  with  a  base  of  1967 
=  100.   Twenty-one  of  these  SMSAS  (Honolulu  and  Washington, 
D.C.  were  excluded)  v/ere  grouped  into  one  of  four  regions: 
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Northeast,  Nortli  Central,  South  and  West.   Two  average 
price  indices  and  a  relative  price  index  v/ere  then  cal- 
culated for  each  of  the  four  regions  and  assigned  to  the 
appropriate  family.   The  rast  of  the  data  are  from  A  Panel 
Study  of  Income  Dynamics  (72)  on  a  sample  of  3,02  8  families. 

In  the  expenditure  equations  the  dependent  variable 
is  logarithm  of  housing  expendituresv divided  by  the  regional 
price  of  housing  index.   The'  same  t^n  explanatory  variables 
are  used  in  the  two  expenditure  equations.   They  include 
the  personal  characteristics  of  the  family  head  (age,  race, 
sex);  family  background  (mover,  logarithm  of  family  size); 
logarithm  of  family  "permanent  income";  regional  variables 
(logarithm  of  city  size  and  logarithm  of  distance  from  .the 
center  of  the  city)  and  a  relative  price  index  of  housing. 
All  these  explanatory  variables  except  the  relative  price 
of  housing  are  included  in  the  decision  function.   For  a 
more  detailed  description  of  these  variables  and  their 
expected  effects  see  Chapter  III. 

The  model  can  be  rewritten  as  the  following  switching 
regression  model 

(6)  C   =  X,  Bt  +  e,   iff  Z  Y  >  e 

n     In  1     In       n'  —   n 

(7)  C   =  X„  Bt  +  Et   iff  Z  Y  <  e  . 

n     2  n  2     2  n       n '     n 

Assume  that  c,  ,  r^   and  e      are  trivariate  normally  dis- 
1  n    2  n       n  -' 

tributed  with  zero  mean  and  covariance  matrix  $7  such  that 

2 


n  = 


a 


o 


21 


le 


12 

2 
'2 

'2e 


a 


le 


2c 
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As  noted  by  Lee  (76),  all  th?  parameters  in  the  above  model 
are  identifiable  except  a^^  "  cov(r.^,e2).   The  maximum  like- 
lihood procedure  described  in  Amemiya  (74)  is  used  to  esti- 
mate this  model.   Following""'Amemiya '  s  proof  it  can  be  easily 
shown  tliat  these  estimates  are  asymptotically  efficient. 

Let  f-^    and  f^    be  the  iointly  normal  distributions  of 
(c^,e)  and  {£2,^)     respectively.   Dropping  the  n  subscripts 
on  C,  X^,  X^'    2  and  I,  the  Tikeliho^^d  function  for  this 

model  is 

2    2 

(8)   L(3^  ,  B^'Y.Oj^  ,0^    '^i£'<^2c^   - 

T    '  Zy  j       ■ 

n=l   -00  Zy        '^ 


n  =  1  -  00  7  Y 


v;here : 


g^(C-X^6^)  =  —^ exp  {-  — i^  (C-X  B,)^), 

^2  ^^"'^2^2^  = ^ exp  { ^  {C-X2B2)^}' 

/27  02       202 


fi(clc-x^p.^)  = 


^27  /:        2 


^"^^  ^-  TTT-'-Tr  t^  -  Pic  ^  (c-x,B,)]'}, 

2(l-p^^  )  1 
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f^  (c IC-X2B2) 


exp  {- 


2(1-P2^  ) 


v'T"  /;       2 

27  f^-  -  ^2e.k    ^^-'"'2^^^'^' 


where  p.,  and  p_  are  the  correlation  coefficients  of  (E,,e) 
and  (p^,c)  respectively. 

The  integrals  in  equation  8  can  be  simplified  bv  trans- 
forminq  the  variables.   Following  Ffeund  (62),-  if 

F(Zy)  =  /   f  (e)dG, 

then  given  a  new  variable  t  such  that  - 

t  =  h(c)  ;  z    =    h~^(t)  ,       ■   .     ^    . 
where  h(e)  is  increasing,  one  can  -transform  variables  and 

obtain 

^^^^'         dh"^(t) 
.G[h(E.)]  =  /      f[H(t)]    dt     '^^- 
—00 

By  selecting  h(c)  as 


/ 


s  -  ^  (C-X^B,)] 


1-p 


l€ 


_t 

e    dt , 


and  solving  for  G[h(e)],  the  first  integral  in  equation 
simplifies  to 

h(ZY) 

/ 

-00     /2  7r 

A  similar  analysis  holds  for  the  second  integral  in  equation 
8.   Finally,  after  making  these  two  transformations  and 
taking  logarithms  one  gets 


45 


(9)   L   =  In  LiB^.B^f'y  ,c^,o^,p^^,p^^) 


T 

);  (  I  [In  (g  (C-X  6,  )  )  "+ 
t=l  .    ■  ^  ^ 


"l 
In  (/   >  (t)dt)  ]  + 


(l-I)  [ln(g2(C-X262))  +  In  (^-  cf)  ( t)  dt).  ]  } 
v;here  '^(t)  is  the  standard  normal  density  function,  . 

^^l^'^'l'^'^^l'^lrj  "  [ZY  -  ^  (C-X^B-^)] 

'     V  T      2      -    1 

and  -         ■ 

^'2  ^^2'^'^2'^2e^  = [Zy  -  ^  (C-X2B2)]. 

/7^    2    *     2 
^"P2e 

In  order  to  calculate  the  maximum  likelihood  estimates, 
a  numerical  optimization  algorithm  must  be  used.   Since  the 
likeli'nood  function  derived  above  is  nonlinear  these  com- 
putations v/ill  be  quite  complicated.   However,  a  simplified 
procedure  discussed  in  Amemiya  (74)  is  available  if  con- 
sistent estimates  of  p,  , Bt , Y ^ o,  ,  o^  ,  p,   and  p„   can  be 

1   2'''  1'  2"^1p:      ^  2e 

obtained.   Let  0  be  these  consistent  estimates.   Amemiya' s 

% . 
"two  step  maximum  likelihood  estimates"  (2SML)  ,  0..,  are 

calculated  from  .  . 

(10)   e   -  0  -  [^^l'^L(0)  j~^  91nL(0) 


M        '    9080'   '      90     • 
The  first  step  of  Amemiya 's  procedure  is  obtaining  the  con- 
sis  tent  estimates  0.   These  can  be  found  using  a  two  stage 
method  described  in  Lee  (76).   Lee's  procedure  uses  probit 
analysis  in  the  first  stage  and  OLS  in  the  second.   The 
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details    of    tliis    procedure    ar(^    found    in    Chapters    2    and    3 
of    Lee     (76).       The    second    step    of    Arnemiya '  s    procedure    is 


% 


substituting  G  into  (10)  and  solvinq  for  0,.. 

,  M 

2 .   A  Comparison  of  OLS  and  Maximum  Likelihood  Estimates 
and  a  Test  for  Simultaneity 

Tables  1  and  2  present  the  probit  and  two  step  maximum 

« 

likelihood  (2SML)  estimates  of  the  decision  function  res- 
pectively.  Both  estimation 'procedulres  yield  similar  results, 
but  the  2SML  estimates  have  slightly  smaller  standard  errors 
for  all  the  estimated  parameters.   All' the  coefficients  are 
significant  and  have  the  expected  signs  discussed  in  Chapter 
III.   This  implies  that  all  the  included  variables  have  a 
significant  impact  on  the  buy  versus  rent  decision. 

Table  3  presents  the  two  stage  and  2SML  estimates  of 
the  expenditure  equations  for  owners.   Table  4  contains 
similar  estimates  for  renters.   As  expected  the  asympotically 
efficient  2SML  estimates  generally  -have  much  smaller  stand- 
ard errors  than  the  two  stage  estimates.   However,  the  two 
sets  of  estimates  are  quite  similar  with  three  exceptions. 
First,  the  mover  coefficient  in  the  owner  equation  changes 
from  the  two  stage  estimate  of  -.04407  to  the  2SML  estimate 
of  -.166199  and  becomes  significant.   Second,  the  city  size 
coefficient  in  the  owner  equation  decreases  from  the  two 
stage  estimate  of  .06592  to  the  2SML  estimate  of  .04237. 
In  terms  of  the  2SML  standard  errors,  these  represent  more 
than  a  three  standard  error  change.   Third,  the  distance 
variable  in  the  renter  equation  changes  from  a  positive 
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TABLE  1 
Probit  Estimates  of  th'^  Decision  Function 

VARIABLES  ESTirtATES  STANDARD  ERROR 

Constant  -3.26^92  0.47282 

Age  <  35           •  -1.17945  0.10754 

36  <  age  <  64  -0.63^6      ^  0.09768 

Black   '  -0.30715  •■       0.07134 

Female  -0.32102  0.06729 

Mover  -0.94359  '-        0.07093 

In  (City  Size)  -0.14969  '   ''   0.01763 

In  (Distance  from 

center  of  city)  0.12779  0.02721 

In  (Family  Size)  0.13434  0.04929 

In  (Relative 

permanent  income)  0.63437  0.05168 
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TABLE  2 

Two  Step  Maximum  Likelihood  Estimates 
of  the  Decisicn  Function 


VARIABLES  ESTIMATES  STANDARD  ERROR 

Constant  -3.24123  0.47183 

Age  <  35  -1.16230  •  -          0.10681 

36  <  age  <  64  -0.63i'51  "^          0.09711 

Black  -0.30971  0.07076 

Female  -0.31895  '       0.06685 

Mover  -0.94527  .    0.07053 

In  (City  Size)  -0.14991  ^            0.01755 

In  (Distance  from 

centery  of  city)  0.12864  .   .        0.02700 

In  (Family  Size)  0.13381  0.04904 

In  (Real 

permanent  income)  0.68159  0.05155 
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sign  to  a  negative  sign  and  'jc^comes  significant.   Given, 
Chapter  Ill's  discussion  on  land  values  and  location  this 
negative  sign  seems  more  appropriate.   VJith  the  exception 
of  price  in  the  renter  equa*tion,  all  the  2SML  coefficients 
are  significant.   Also,  they  have  the  ej:pected  signs  dis- 
cussed in  Chapter  III. 

Although  past  papers  such  as  Uee  (68) ,  Carliner  (73) , 
Fenton  (74)  and  Polinsky  (75^  specified  slightly  differ- 
ent expenditure  equations,  a  fev;  comparisons  are  possible. 
The  permanent  income  elasticity  estimates  of  .60678  for 
owners  and  .50046  for  renters  are  quire  close  to  Carliner' s 
estimates   of  .631  for  owners  and  \520  for  renters.   Fenton 
estimated  permanent  income  elasticities  of  .41,  .46  and  .46 
for  all  renter  households,  and  high  and  low  income  renter 
households,  respectively.   Lee  (68)  reported  several  sets 
of  income  elasticity  estimates.   His  estimates  of  permenent 
income  elasticity  for  owners  ranged  from  .608  to  .892.   For 
renters,  his  permanent  income  elasticity  estimates  ranged 
from  .387  to  .678.   Polinsky  concludes  that  the  true  incom.e 
elasticity  is  approximately  .75  for  both  renters  and  owners. 
Finally,  the  coefficients  for  the  relative  price  variables 
are  .negative.   These  coefficients  are  not  reliable  estimates 
of  traditional  price  elas tici ty--percentagc  change  in  quantity 
per  percentage  change  in  price.   This  is  true  for  tv.'o  reasons. 


Carliner  uses  the  same  data  base  as  mine.   His  estimates  of 
,631  and  .520  are  based  on  a  regression  using  four  year 
average  income,  relative  price  of  liousing,  and  dummies  for 
Black,  female  and  age  as  explanatory  variables.   Also,  his 
coefficients  for  the  Black,'  female  and  age  dummies  are 
similar  to  my  estimates. 
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First,  the  dependent  variable  is  deflated  dollar  expendi- 
tures on  housing.   The  price  coefficients  are  therefore 
percentage  change  in  deflated  dollar  expenditures  per 
percentage  change  in  price.   Second,  the  poor  quality  of 
the  available  price  data  makes  any  estimate  of  price  elas- 
ticity equally  poor.   This  would  be  the  case  even  if  the 
dependent  variable  were  measured  in  units  of  quantity. 
Tlie  negative  signs  of  the  two  price  coefficients  do  indi- 
cate that  families  living  in  those  regions  v;here  housing 
is  relatively  (to  all  other  goods)  more  expensive  tend  ho 
spend  less  on  housing. 

It  would  be  useful  to  compare  the  2SML  estimates  of 
the  expenditure  equations  with  OLS  estimates.   The  OLS 
estimates  were  obtained  by  dividing  the  sample  into  two  ' 
groups  -  owners  and  renters  -  and  applying  OLS  to  each 
group  separately.   These  results  are  presented  in  Tables 
5  and  6-.  -     ■ 

A  comparison  of  Tables  3  and  5  shows  the  2SML  and 
OLS  estimates  to  be  slightly  different  but  compatible. 
One  exception  to  this  compatibility  is  the  distance  vari- 
able coefficient.  'The  value  obtained  with  2SML  is  five. 

V 

times  greater  than  t,he  OLS  estimate.   Also,  the  2  SML  , 

t 
estimate  is  significant  and  the  OLS  estimate  is  insignifi- 
cant. 

A  comparison  of  Tables  4^ and  6  again  shows  a  difference 
in  the  distance  coefficient.   The  2SML  estimate  is  negative 
and  significant.   The  OLS  estimate  is  positive  and  insignificant, 
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All  the  other  coefficients  are  nearly  the  same. 

While  these  casual  comparisons  do  reveal  some  differ- 
ences in  the  two  approaches,  a  more  detailed  investigation 
on  whether  or  not  simultanaity  occurs  is  possible.   This 
investigation  involves  the  use  of  a  maximum  likelihood  ratio 
test.   Consider  the  null  hypothesis  that  there  is  no  correla- 
tion between  the  disturbances  in  the. housing  expenditure 
equa.tions  and  the  decision  ftinction'^.   That  is,  the  null 
hypothesis  contrains  a    and  a    to  zero.   The  maximum 
likelihood  estimates  of  the  expenditure  equations  now  reduce 
to  simple  OLS  estimates,  while  the  maximum  likelihood  esti- 
mates of  the  decision  function  are-  probit  estimates.   Denote, 
these  estimates  as  0„.   The  alternative  hypothesis  does  not 

constrain  o -,       and  o„   to  zero.'  The  maximum  likelihood  esti- 
le       2f:  ■ 

mates  are  now  the  2SML  estimates  presented  in  Tables  2-4. 

Denote  these  estimates  as  0.,.   Given  -G^  and  0,.  the  ratio  -2 

M  0       M 


In 


L(Oq) 


has  a  Xt  distribution  asymptotically.   For  this 


model  the  ratio  is  5.771  and  significant  at  the  0.05  level. 
This  implies  that  simultaneity  does  occur,  albeit  the  evi- 
dence is  v/eak.   It  then  follows  that  OLS  yields  biased  esti- 
mates of  the  expenditure  equations,  even  if  this  bias  is  not 
very  strong.   Therefore  estimation  procedures  such  as  the 
two  stage  and  2SML  approaches,  which  account  for  the  simul- 
taneity, are  more  appropriate. 
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TADLC  5 
Housing  Expenditure  Eq'.intion  of  the  Owners 


CTLS 
VARIABLES  ESTIAT-ITES       '        STANDARD  ERROR 


Constant       -  1.49477 

Age  <  35  -0. 15345       '  "  0.0413 

36  <  age  <64  -0.16520  0.03  3  59 

Black  -0.26055  0.05709 

Female  0.08828         .  "*  0.03129 

Mover  -0.09916  0.03732 

In  (City  Si-ze)  0.05922      '  0.00572 

In  (Distance  from 

center  of  city)  -0.00563  0.03646 

In  (Family  Size)  0.03752  0.02284 

In  (Relative 

price  of  housing)  -2.62209  1.085 

In  (Real 

permanent  income)  0.53399              '    0.02079 
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TABLE  6 
Housing  Expenditure  Equrition  of  the  Renters 


OtS 
VARIABLES  ESTIMATES  STANDARD  ERROR 


Constant  1.77267 

Acje  ..<  3  5  -0.1842  3  •  .         0.05846 

36  <  age  <  64  -0.12172  "^     ,-     0.05634 

Black  -0. 19701  0.0465 

Female             .   0.11751  _  -.  0.02936 

Mover  0.13  878  "       0.02877 

In  (City  Size)  0.06964     ^  0.00778 

In  (Distance  from 

center  of  city)  0.00698  0.02998 

In  (Family  Size)  0.08309  0.02050 

In  (Relative 

price  of  housing)  -1.81576  1.36963 

In  (Real 

permanent  income)  0.50212  0.02368 

Estimated  standard  error  of  disturbance  o^   =    0.4348 


CHAPTER  VI 
CONCLUSIONS 

The. model  derived  in  chapter  three  of  this  thesis  begins 
with  the  premise  that  "housing  services"  are  not  a  homogeneous 
good.   Rather,  housing  is  viev/ed  as  a  bundle  of  several  commod- 
ities such  as  location,  size  of  house,  type  of  structure, 
neighbors  and  other  humanities.   This- view  of  housing  is  some- 
what different  than  the  Olsen  (69)  approach  where  hpmogenity 
is  assumed  and-  the  King  (76)  approach  v/hich  estimated  demand 
equations  for  Lancastrian  characteristics. 

It  is  then  argued  that  some  bundles  are  cheaper  to  con- 
sume if  the  family  buys  their  own  home.   Conversely,  other 
bundles  may  be  cheaper  to  consume  if  rented.   A  family  is 
assumed  to  consume  that  bundle  which  maximizes  utility. 
Following  this  line  of  thought  a  general  model  of  consumer 
choice  between  owning  and  renting  -  and  how  much  to  spend  - 
is  developed.   The  model  is  a  switching  regression  model 
v;ith  a  choice  function  that  is  not  assumed  to  be  independent 
of  the  expenditure  equations.   This  differs  from  past  studies 
of  housing  where  independence  is  tacitly  assumed  and  simple 
OLS  is  applied  directly  on  the  owner  and  renter  expenditure 
equations.   Given  that  some  simultaneity  is  'present-,  these 
OLS  estimates  will  be  biased.   Other  consistent  estimation, 
procedures  must  be  used.   In  this  thesis  consistent  estimates 


56 


...  57 

of  the  model  arc  found  with  ?i  "two  stage"  technique  and 
a  two  step  iTiaximum  likelihoorl  procedure. 

Both  the  "two  stage"  and  "two  step"  procedures  perforin 
quite  well  in  terms  of  standard  errors  and  explained  vari- 
ation.  For  example,  all  but  one  of  the  coefficients  esti- 
mated by  the  two  step  procedure  are  significant  at  the  5% 
level.   Only  the  estimate  of  price  elasticity  in  the  renter 
equation  is  insignificant.   'Also,  with  the  exception  of  the 
price  elasticity  estimates  the  empirical  results  are  com- 
patible with  economic  theory  and  other ''recent  studies.   The 
two  step  estimates  of  permanent  income  elasticity  of  .60678 
for  owners  and. 50046  for  renters  are  significantly  less  one. 
These  estimates  support  the  results  found  in  Polinsky  (75), 
Fenton  (74),  Carliner  (73)  and  Lee  (69),  but  differ  with 
Muth's  (60)  results.   Muth  (60)  uses  aggregate  time  series 
data  and  concludes  that  the  income  elasticity  for  desired 
housing  stock  is  at  least  1.0.   The  two  steps  estimates  for 
owners  indicate  that  families  living  in  rural  areas,  families 
who  frequently  move,  and  families  headed  by  a  black  tend  to 
consume  smaller  units  of  housing.   Conversely,  large  families 
and  families  headed  by  either  a  female  or  by  someone  over  64 
tend  to  consume  larger  units  of  housing.   The  results  are  sim- 
ilar for  renters  witli  one  exception.   Families  who  frequently 
move  tend  to  consume  larger  units  of  housing.   The  two  step 
estimates  of  the  decision  function  indicate  that  families 
living  in  rural  areas,  families  who  frequently  move,  and 
families  headed  by  a  black,  female  or  someone  under  36  are 
the  least  likely  to  own.   Conversely,  the  larger  the  family 
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size  or  the  higher  the  perma'^ent  income,  the  higher  is  the 
probability  of  owning.       ■     .    ' 

The  only  discouraging  results  were  the  estimates  of 
price  elasticity.   Two  diff-grent  price  specifications  were 
tried  and  both  yielded  unsatisfactory  results.   The  specifi- 
cation used  in  chapter   IV   resulted  in  a  positive  estimate 
of  price  elasticity  for  owners.   The.  specif ication  used  in 
chapter  five  resulted  in  a  negative'^  but  elastic  (-2.59  for 
owners  and  -2.11  for  renters)  estimate  of  price  elasticity. 
This  elastic  estimate  is  not  supported-*  by  recent  studies. 
For  example,  Polinsky  (7  5)  concludes  that  housing  demand  is 
price  inelastic.   These  poor  price^  elasticity  estimates 
suggest  that  price  data  siiould  be  collected  by  the  Survey 
Research  Center's  next  "Panel  Study." 

To  test  the  hypothesis  that  simultaneity  between  the 
expenditure  and  choice  equations  does  exist,  the  two  step 
maximum • likel ihood  estimates  are  statistically  compared  to 
the  usual  OLS  estimates.   By  using  a  maximum  likelihood  ratio 
test  to  make  this  comparison,  evidence  was  found  that  simul- 
taneity does  exist.   Therefore,  the  procedures  used  in  this 
thesis  to  estimate  the  expenditure  equations  are  more  appro- 
priate than ' the  usual  OLS  approach. 

Finally,  a  few  suggestions  for  future  research  are 
wortli  making.   Firstly,  a  housing  demand  model  using  the 
entire  five  year  Survey  Research  Center  sample  could  be" 
derived  and  estimated.   This  would  not  be  a  simple  chore 
since  the  required  estimation  technique  needs  to  be  worked, 
out.   In  general,  tlie  procedures  used  in  this  thesis  would 
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not  be  appropriate  to  estima'e  a  model  based  on  pooled 
cross-section  and  time  series  data.   Secondly,  a  mod-el  that 
described  wliy  families  move  within  a  given  area  could  be 
formulated  and  estimated.   The  underpinnings  for  such  a 
model  are  presented  in  the  Appendix  of  this  thesis.   Lastly, 
the  computer  program  used  to  obtain  the  two  step  estimates 
has  many  applications.   For  example, 'it  could  be  used  to 
estimate  a  model  that  explains  electric  utilities'  demand 
for  oil  and  coal  and  why  these  utilities  "switch"  from  coal 
to  oil  and  vice  versa. 


APPENDIX  1 
DISCUSSION  OF  THE  DATA 

TJie  majority  of  the  data  were  constructed  from  the 
Survey  Research  Center  (SRC)  ^sample.  ^ The  only  exception 
was  the  construction  of  the  sundry  price  indicies.   These 
indicies  were  constructed  from  the  Bureau  of  Labor  Statis- 
tics' Handbook  of  Labor  Statistics,  1972.   For  an  explana- 
tion, of  how  these  indicies  were  formulated  see  Chapter  III, 
Section  2  and  Chapter  V,  Section  2.  ,     ' 

Tlie  original  SRC  sample  came  from  two  sources.   About 
40  percent  of  the  families  first  interviewed  in  1968  had 
been  in terviov/ed  previously  in  1966  and  1967  by  the  Bureau 
of  the  Census  as  part  of  the  Survey  of  Economic  Opportunity 
(SEO) .   The  remaining  50%  consisted  of  a  cross-section 
sample  of  dwellings  of  the  coterminous  United  States.   The 
1969  to  1972  samples  consisted  of  all  panel  members  living 
in  families  that  were  interviewed  the  previous  year  and 
newly  formed  families  containing  any  adult  panel  member 
who  had  moved  since  1968  from  a  sample  family.   The  res- 
pondent was  usually  the  head  of  the  family.. 

All  of  the  original  SEO  families  selected  in  1968  had 
incomes  in  1966  equal  to  or  below  twice  the.  federal  poverty 
line  at  that  time.   The  selection  formula  was  $2000  +  N($1000) 
where  N  is  the  number  of  individuals  in  the  family.   Excluded 
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were  families  where  the  head  vas  over  60.   Also,  except  in 

the  South  the  original  SEO  subsample  only  included  families 

who  lived  in  SMSA's.   In  summary,  the  1968  interviews  were 

taken  with  4802  families  bi'-^SRC,  1872  from  the  SEO  selection 

frame  and  2930  from  its  cross-section  samples. 

Even  though  the  SRC  sample  seems  biased  tovN^ard  p9or 

« 
families  in  1968,  by  197  0  the  data  v/fere  a  fairly  represent- 

ative  cross-section  of  families.   This  can  be  seen  in  Tables 
1  and  2,  whicli  v/ere  taken  from  The  Pangl  Study  of  Income 
Dynamics  (1972)  ,  Volume  1.   These  tables  compare  the  SRC  data 
for  1970  to  a  corresponding  national  sample  taken  at  approx- 
imately the  same  time,  the  1970  Current  Population  Survey. 
Variables  used  for  the  comparison  were  family  money  income, 
race,  sex  of  head,  size  of  family,  number  of  children  under 
eighteen  in  family,  age  of  family  head,  and  size  of  place 
of  residence.   For  example,  both  distributions  of  family 
income  show  about  the  same  percent  below  and  above  $5,000, 
though  SRC  does  have  a  slight  underrepresentation  of  very 
poor  families  (Income  under  $1,000).   The  comparisons  according 
to  the  demographic  variables  are  also  close.   There  are  a 
few  notable  exceptions.   For  black  families  SRC  does  have 
a  slight  loss  among  two  person  families  and  those  with  no 
children  under  acje  eighteen.   For  white  families  there  is  a 
slight  underrepresentation  of  those  with  heads  of  families 
forty-five  or  older.  . .   ' 

Finally,  this  thesis  only  uses  the  1971  data.   The 
sample  consisted  of  3,452  families.   Excluded  from  this 
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TABLE  I 

Comparison  of  Census  and  SRC  Data: 
Total  Family  Income  by  Race,  for  all  Families 


Total  family 
money  income 

Under  $1000 

$1000-1999 

$2000-2999 

$3000-3999 

$4000-4999 

$5000-7499 

$7500-9999 

$10,000-14,999        24.0    12.1 

$15,000  or  more       17.4     5.8 


Census 
White   Black 

3.  0. 

6.  9  ■ 

6.  4 

t 

13.0 

5.9 

"9.9 

5.8 

10.4 

5.4 

9.0 

0    33.2 


SRC 

VJhite 

Black 

1.5 

■  2.7 

5.5 

13.0 

6.1 

14.5 

5.  5 

10.0 

5.9 

9.0 

16.  5 

18.1 

15.4 

12.4 

23.  4 

14.9 

20.1 

5.4 

99.9   100.3  99.9   100.0 


63 


tablf;  2 

Comparison  of  Census  and  SRC  Data: 
Race  by  Various  Demographic  Variables, 
for  Families  with  Two  or  More  Members 

Cen'Bus  SRC 

Sex  of  family  head    White   Black  White  Black 

Male  90.0   '71.7  89.7   69.1 

Female  9.1    28.3  10.3   30.9 


100.0       100.0  '  100.0    100.0 


Size  of  family 


2  persons  35.2  28.7 

3  persons  20.9  19.9 

4  persons  19.7  16.1 

5  persons  12.6  11.7 

6  persons  6.5  9.5 

7  persons  or  more  5 . 2  14 . 1  ^ 

100.1  100. 0            100.1   99. 9 
Number  of  children 
under  18 


Uone 

42.9 

30.5 

One 

18.8 

19.8 

Two 

18.2 

17.1 

Three 

10.8 

11.4 

Four  or  more 

9.1 

21.3 

Not  ascertained 





99,9 

100.1 

Age  of  family  head 

under  2  5 

6.6 

Under  25 

9.3 

25-29 

10.5 

11.6 

30-34 

9.9 

11.8 

35-4-1 

21.2 

21.9 

45-54 

21.3 

19.6 

55-6  4 

16.  3 

15.  3 

65-74 

9.4 

6.  8 

7  5  and  over 

4.7 

3.8 

99.9 

100.1 

Size  of  place  of 

residence 

SMSA 

63.7 

72.  6 

Non-metropolitan 

36.  3 

27.4 

35.3 

22.4 

20.  3 

21.3 

19.5 

16.6 

12.0 

14.7 

■  6.7 

8.3 

6.3 

16.6 

39. 

,  8 

2-1. 

,  5 

19. 

,  3 

20. 

,5 

19. 

.  5 

19. 

.6 

10, 

,6 

12. 

,8 

10. 

,  7 

20. 

,9 

0. 

.2 

1. 

.6 

100..1       99.9 


7. 

,7 

7. 

7 

11. 

,0 

10. 

,2 

9. 

.2 

12. 

.0 

23. 

,2 

23. 

2 

20. 

,2 

21. 

,2 

15. 

.5 

12, 

.  0 

9  , 

,4 

9, 

.9 

'  3. 

,  7 

3. 

.J^ 

99. 

,  9 

99, 

.9 

6  3.3       7.5.9 
36. 6       24.1 


100.0       100.0  99.9    100.0 
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sample  were  families  v/ho  at  nny  time  in  the  five  year  inter- 
view period  lived  outside  tho  continental  U.S.   Also  excluded 
were  tliose  families  who  at  any  ,time  during  the  study  either 
both  owned  and  rented  or  di"9  not  own  or  rent.   This  second 
group  included  families  who^ received  housing  free  from  a 
friend  or  relative,  or  received  housing  as  part  of  their 
wage.   'The  final  sample  used  for  estimation  consisted  of 

.w 

3,026  families,  1,225  of  these  rented  in  1971  and  1,803  owned 
their  own  homes. 


APPENBIX  2 
EXTENSION  OF  THE  MODEL  TO  T-HE  MULTI -PERIOD  CASE  . 

In  this  section  I  will  assume  there  are  only  two  differ- 
ent  bundles  of  housing  goods.   One  of  the  bundles  can  be 
owned,  the  other  rented.   In  light  of  chapter  three,  this 
assumption  may  seem  quite  restrictive.   It  is  not.   The  re- 
sults derived  in  this  section  apply  with  equal  force  to  the 
more  general  case  of  several  housing  bundles.   The  following 
notation  will  be  used.  ' 

U   =  U(x,h. )  is  the  household's  utility  function,  where 

h,  is  the  flow  of  housing  services  received  per  unit 
of  time  when  the  house  is  owned  and  x  is  dollars  of 
expenditure  per  unit  of  time  on  all  other  goods. 
U   =  U(x,h.)  is  the  same  as  above,  except  the  housing  is 

rented, 
h,  =  h  6„(t)  is  the  flow  of  housing  services  oer  unit  of 
time  by  an  owned  dwelling  of  initial  size  h^ .   The 
depreciation  function  6  (t)  has  the  following  pro- 
perties:  "5^(0)  =  1,  6g(t)->0  as  t^™,  and  6^    <    0. 
h   =  h  (S  (t)  is  the  same  as  the  above,  but  the  initial 

size  dwelling,  h„,  is  rented. 
p„(t,li  )  =  per  unit  price  of  owned  housing  at  time  t, 

3Pg(t,hj^)  ^ 

where  ^r; =  Pn  (')  T  0- 

9hj3        B      > 

P-_(t,h,J  =  per  unit  price  of  rented  housing  at  time  t, 


where  p^  ( • )  j   0 
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Y(t)  =  family  income  receipts  at  time  t. 

Cgdij^)  =  transaction  co.sts  of  buying  a  home  of  initial 

size  hg.   These  costs  do  not  include  the  time  and 

money  costs  of  moving.   9C  (h  ) 

hi 


jj  ('i3<''3 1 1]  )  =  transaction  costs  of  selling  a  home  in 

« 
period  t.   Again,  ™^i')    does  not  include  the  time 

and  money  costs  of 'moving? 

3m^(h3.S^[t]) 


91i. 


<^3(t)m^(hj^[t])  >  0. 


B 

k  =  time  and  money  costs  of  moving.   For  any  family,  k 
is  the  same  for  all  typeS  of  moves.  That  is,  it 
makes  no  difference  if  the  family  moves  from  a 
rented  house  into  an  owned  house,  or  vice  versa. 

f (t)  =  a  probability  density  function  of  living  in  a 

given  area  for  exactly  t  years.   f(t)  is  defined 

over  the  interval  0  <  t  <  t^ ,  where  t   is  the 

—   —  m         m 

household's  time  to  retirement.   Each  family  is 
assumed  to  possess  such  a  density  function. 
For  simplicity,  I  first  assimie  that  the  family  can 
occupy  only  one  dwelling  during  their  stay  in  any  given  area. 
This  assumption  will  be  relaxed  later. 

Following  Muth  (74) ,    if  we  assume'  no  time  preference 
an    the  part  of  the  household,  its'  utility  aggregated  for  t 
years  over  the  occupancy  of  an  owned  dwelling  of  initial  size 


h^  is 


U(t)  =  /^U[x(v)  ,h_6„(v)  ]dv 
0  ti  B 
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■Apart  from  its  initial  wealth,  the  present  discounted 
value  of  the  stream  of  v;ealti-i  up  to  time  t  for  the  family 
who  buys  h^^  is  .  -% 

w(t)  =  /^[y(v)  -  x(v)  -  hgPg(v,hg)6g(v)e"^^dv  -  Cg(hg) 

-  m^(hg6g[t])e~^^- 

where  i  is  the  household's  discount  rate. 

In  selecting  an  owned  dwelling  characterized  by  initial 
size  h  ,  assume  the  family  maximizes 

(1)  E(U)  =  /^m  U(t)f  (t)dt. 

0 

The  maximization  is  subject  to  the  constraint  that  expected 
wealtli  when  the  family  moves  from  the  area  is  equa],  to  k,  or, 

(2)  E(w)  -  /^m  w(t)f(t)dt  =  Y  -  h„P„  -  X  -  T  =  k, 

0  a    a 

where: 

Y  =    /^m  [/V(v)e~^^dv]f  (t)dt, 

0       0 

P„  =  f^m    [/^p-,(vjv,)6„(v)e"^''dv]f  (t)dt, 
X  =  /'*^m  [/^x{v)e~^^dv]f  (t),dt, 

0        0 

T  =  C^(h^)  +■  /^m  [m,  (h„6^(t))e~'-^]f  (t)dt. 
B   B       0      b   B  B 

Let  L  =  E(U)  +  |iE(W),  where    is  an  undetermined  multiplier. 

The  household  selects   the  function  x (t)  and  h„  so  as  to 

maximize  L.   The  function  x(t)  must  satisfy 

f(t)f^   U  (v)dv  -  \if  {t)  f^e~^^6v   =  0     ■ 

_  0   X  0  '  •  •   ■ 

and  upon  differentiating  with  respect  to  t 

(3)  U^tt)  =  iie""-^.  ■ 
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Differentiating  L  with  respe<7t  to  h^  yields 

t 

(4)  /  ""[/  U,  f,  (v)clv]f  (t)dt 

t    ,         - 

'   -  U/  ""  [/  (Po(v;h^)6R(v) 

0       0   "  ts      "fe    ^ 

+    hgp^(v,hj^)  6p^(v))e"^^dv]f  (t)dt 
-  iHC^(hg)  4-  /^"^[r,j^(hg6j3(t))  6j^{t)e"^^]f  (t)dt}  =    0 

Dividing  by  u  we  get 

/  ""  [/   -Ty^6p(v)dv]f  (t)dt 
■  0      "     X  '  ' . 

-  /  "^  [/^  (pg(v,h3)^g(v)  +  hgP^(v,hg)6g(v))e~'-''dv]f  (t)dt 

=  C^(hg)  4-  /^m  [mj'^(h363(t))63(t)e~^^]f  (t)dt, 

U  . 
or,  substituting  g  =  _'■  ,     and  combining  terms, 

e 

(5)  /  "'{/^[-yy^  -  MC„(v)]6„(v)e~^''dv}f  (t)dt 

t  _.  , 

=  C^(hg)  +  /  '^  [mjl^  (hg6g(t))6g(t)e  '■'']f{t)dt, 

where  MC,  (v)  =  Pg(v,hj,)  +  hgPj^(v,hg),  the  marginal  cost  at 
time  V  of  buying  a  larger  house  of  initial  size  h^,.   The 
term  inside  braces  in  (5)  is  the  weighted  average  (weighted 
by  6  (v)e~"'''^)  of  the  difference  betv/een  the  marginal  rate 
of  substitution  of  housing  for  other  consumption  and  the 
relative  (to  other  goods)  marginal  cost  of  owned  housing 
services.   The  right  hand  side  of  (5)  is  the  expected  pre- 
sent discounted  marginal  transaction  cost  of  buying  (and 
selling)  h  .   The  higher  the  probability  that  a  family 
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will  move  in  the  near  future,  the  higher  are  these  trans- 
action costs.   These  transaction  costs  do  nothing  more  than 
shift  to  the  left  Chapter  Ill's    "consumption  possibilities 
frontier  for  owning."   When  (3)  and  (5)  hold,  the  family 
maximizes  (1) .  .  ' 

Let  the  optimal  solution  to  (3)  and  (5)  be  E(UJ^)  where 

t 
(6)   E(U*)  =  /  "^  U*  (t)fH;t)dt.  ^  ' 

If  the  family  chooses  to  rent,  they  maximize  (1)  subject 


to 


R 


(7)  E(VJ)  =  Y  -  h^P^  -  X  =  k 

v/here: 

t     , 

^R  "  -^o""  ^-C  Pr^'^'^'r^'^R^^^^  ^'']f(t)dt 

Again,  the  function  x(t)  must  satisfy  (3).   The  optimal  h 

must  satisfy 

t     ,   h 

(8)  /  ""  {/  [-TT^  -  MC^(v)  ]6„(v)e  ^""dvlfftldt  =  0, 

0       0    U  K        K 

where  MC  (v)  =  P  (v,h  )  +  h  p'(v,h  ),  the  marginal  cost  at 
time  V  of  renting  a  larger  apartment  of  initial  size  h  . 

Let  the  optimal  solution  to  (3)  and  (8)  be  E(UM  where 

t 

(9)  E(U*)  =  /  "^  U*  (t)f  (t)dt. 

r      0    r 

'The  family  will  choose  to  own  if  (6)  is  larger  than  (9). 
Similarly,  they  will  rent  if  (9)  is  larger  than  (6).   In 
general  this  choice  will  depend  on  U(x,li,),  U(x,h  ),  6p^t),  6  (t), 
P|^(-),  Pr(-),  y{t),  Cj,(hp),  m,^(h^^(t)),  f(t)  and  the  families 
discount  rate  i.   Ceteris  paribus,  a  higher  T  or  a  higher 
f(t)  in  early  periods  v/ill  slant  a  family's  choice  toward 
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renting.   This  is  consistent  V7ith  the  "mover"  variable 
discussion  in  Chapter  IV,     Prices  and  preferences  enter 
the  rent-ov.'n  decision  in  a  manner  similar  to  Figure  1 
in  Chapter  III.  ^ 

Up  to  nov7  I  have  assumed  that  the  family  must  occupy 
the  initial  dwelling  until  the'y  move  from  the  area  or  retire. 
More  generally,  there  will  be  some  optimal  number  of  moves. 
That  is,  it  may  be  optimal  for  the  f'ariily  to  first  occupy  a 
rented  dwelling  of  initial  size  h  (0),  and  at  some  time,  t, , 

K  I 

move  to  ah  owned  home  of  initial  size  h  (1) .   Conversely,  they 

may  want  to  own  first  and  rent  second.   If  we  assume  that  there 

are  only  a  finite  number  of  moves  possible,  then  the  family 

must  choose  moving  periods  t, ,  t„,  .  .  .  t   (t   <  t  )  and  h 

-^^         12  nn—  m 

so  as  to  maximize 

t     t_<t, 

(I)  E(U)  =    f    ^     [f         ^    U(x(v)  ,h(v)  )dv]f  (t)dt 

0        0 

t^    t,<t£t_ 
+  /    [/         U(X{v) ,h(v)dv]f {t)dt  +  .  .  .  . 
^1    ti<t<t2 

t     t  <t<^t 

+  /  "^  [/  '^    ^   U(X(v)  ,h(V)  )dv]f  (t)dt 

t     t  <t<t 
n     n   —  m 

v;here  h(v)  is  either  h.  or  h  .   The  maximization  is  subject 

to  the  constraint  that  expected  wealth  when  the  family  moves 

from  the  area  is  again  a  constant  k,  or  ' 

(II)  E(W)  =  Y  -  X  -  [h(l)P(l)  +  .  .  .  +  h(n)P(n)]  -  TTC  =  K 
where':  .  '      •  • 


t     t<t- 
h(l)P(l)  =    f    ^    If         ^-  p  (v,h(l)  )  6(v)e  ^^dv]f  (t)dt, 

0        0 


t^     t-,<t£t^ 

h(2)P(2)  =    f    ^     [f    ^  ^    p{v,h(2) )6(v)e  ^^dv]f(t)dt,  etc, 

4    4^t<4 
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TTC  =  total  transaction  costs  of  all  moves.   These 
costs  depend  on  when  the  moves  take  place  and 
the  type  of  move. 
Also,  h,  p,  6  are  either  hJ^    p  ,  6   or  h  ,  p   or  6     While 

D  ID  LJ  K        K  K 

4 

the  solution  to  (10)  and  (11)  is  not  straightforward,  in 
general  the  optimal  strategy  will  depend  on  prices,  pre- 
ferences  and  the  density  function  f (t) .   For  example,  if 
the  optimal  strategy  is  t,  =  t   (and  therefore,  t, ,  t~, 
.  .  .  t   all  equal  zero),  then  the  solution  reduces  to  (3) 
and  (5)  or  (3)  and  (8). 

In  this  section  I  have  presented  a  multi-period  model 
of  housing  demand.   It  extends  Muth's  model  in  four  important 
ways.   First,  I  have  allowed  for  two  homogeneous  bundles  of 
housing  services  rather  than  one.   Second,  Muth  assumes  the 
price  of  housing  is  constant.   In  my  model  price  is  a  func- 
tion of  quantity.   Third,  I  explicitly  distinguish  betv;een 
moving  costs  for  owners  and  renters.   Muth  does  not.   Fourth, 
I  allow  the  family  to  plan  moves  while  still  living  in  a 
given  area.   Muth  treats  moves  "v;ithin  the  area"  the  same  as 
moves  "away  from  the  area."   Therefore,  in  his  paper  each 
time  the  household  moves  prior  to  its  retirement,  it  repeats' 
the  decision  process  under  the  conditions  then  prevailing. 
In  otlier  words,  Muth  does  not  allow  for  the  optimal  moving 
strategy  implied  in  (10)  and  (11). 
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